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FEATURES 



'x\.Eo 

--creature ^ToiT^^ 
mtsc_featur e iott^ll ^ 

-sc_£ eat ura 7 / ?n e ";'594 B81 " din9 re3ion " 

/note="Mo2 codij 



^ G I TAAAAAATGA GCTGATTT^ 

^TTCAGGTGGCACTTTTCGGGGAAA^^ 

TTCAA ATATGTATCCGCTCATGAaW^ 
A ^ ATTCATATC *GGA^ 



s F hSt r Yo ( T leotide $eque,lce of TbF14 

cactccgctatcgctacgtg; 



GCCCTGACG^CTTGT^ 

CATGTGTCAGAGGTTTTCACCGTCATCACCGA^ 
^SGTCGTGAAGCGATTCACAGATGTCTGC^ 

TGATGCCTCCGTGTAAGGGGGATTTCTGTTCATGGGGOTAt?^? 0 ^ 
GCTC^CGATACGGGTTACTGATGATGAA^ 

GGCGGTATGGATGCGGCGGGACCAGAGAAAAATCACTCArrr^^^ ^ 

AGATGTAGGTGTTCCACAGGGTAGCCA^^IS^ 

GGGCGCTGACTTCCGCGTTTCCAGAC^^^^^ 

CTAACCAGTAAGGCAACCCCGCCAGCCTAGCCG^ 
-CGAAGGGTTGAGCGAGG^^ 

rCGCCGAAAATGACCCAGAGCGCTGC 
iTAAGTGCGGCGACGATAGTCATGCC 
AGGGCATCGGTCGAGATCCCGGTGC 

AATCGGCCAACGCGCG^ 



CCGTGGGGCCGCCATGCCGGCG^iGSJc^ 
GACGAAGGCTTGAGCGAGGGCGTGCAAGATTCCGAa™^^ 

gctccagcgaaagcggtcctcgcc^a^^ 
~atcataaagaagacagtcatSg?^ 

3ACTGGGTTGAAGGCTCTCAAGGGCA^^ 

^ttaattgcgttgcgctcactgcccgctttccag?cgg CCGG ^ 
^gccaacgcgcggggagaggcggtSg^ 

gtgagacgggcaacagctgattgcccttSccgcctg^^ 

cgctggtttgccccagcaggcgaaaatcctgtttgatggtggt 
tgtcttcggtatcgtcgtatcccactaccgaga^^ 
tggcgcgcattgcgcccagcgccatctgatcgttggca^^ 
cattcagcatttgcatggtttgttgaa^ 

aacttaatgggcccgctaacagcgcgatttgctggSa^ 
gtcgcgtaccgtcttcatgggagaaaataI?aSot?gS 

taatgatcagcccactgacgcgttgcgcSgSS?^gS^^ 



G 



cgcttcgttctaccatcgacaccaccacgctggcacccag^^^ 

cga caatttgcgacggcgcgtgcagggccagac^p C ^J gatcg gcgcgagatttaatcgco 
tscccgccagttgttgtgcJSg^ 

TTTCCCGCGTTTTCGCAGAAACGTGGCTGG^ 
CAGCGGCATACTCTGCGACATCGTATAAr 0 ?^™!^ 



cttccc^cgctatcatgccaS^ 
cgctctcccttatgcgactcctgcatt^^ 

G CCGCCGCAAGGAATGGTG<^TGCAAG^ 

accatacccacgccgaaacaagcotStcag??^ 

ATOiaMO^TATAOQa^ 



gcgtagaggatcgagatctcgatcc^ 

ATAACAATTCCCCTCTA^ 
CCACCATCACCAOUT^^ 
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GTCGTCGCCGACCTGACCCCGCAGAACCAAGCTCTGTTGAACGCCCGCGACGAGCTGCArrrrrar 

ATCGACAAGTGGCACCGGCGTCGGGTGATCGAGCCCATCGACATGGATG?^ 

ACCGAGATCGGCTACCTGCTTCCCGAACCTGATGACTT^ 

GAGATCACCACGACCGCCGGCCCCCAGCTGGTGGTGCCGGTGCTCA^CGC^^ 

ga?gSc^ CGCTCGCTGGGGCTCCCTCTACGACGCCTTOT ^^ 



GA 2^^^ 

\GTG 
3TTC 

:atc 

3ATC 



rCAGTTC 
tCACATC 

C ^^ CCGCGAraCCACGAT ^TGGAC^CG^ 



GCCGGCTACACCGGCGCAGCCGAGTCGCCGACATCGGTGCTGCTAATCAAT^CGGTTTPrAraTn 
GAGATCCTGATCGATCCGGAGTCGCAGGTCGGCACCACCGACCGGGCCGGCGT^ 



GACGGCACCGCTTTCCTGCGGGTGCTCAATAGGGACCGGAACTACACCGCACCCGGCGGTGrrrAr 

atcgtcgacactgacggcagcgaggtgttcgaaggcatcatggatgccctattSc^cc^ 

^catccacgggctaaaggccagcgacgtcaacgggccgctgatcW 

tacatcgtcaagccgaagatgcacggtccggccgaggtggcgtttacctgSS^ 

gttgaagatgtgctggggttgccgcaaaacaccatgaagatcggcatcat^Sg^ 
A S A S GGTCAACCTCAAGGCGTGCATG ^ gg tgccgcggaccgcgt^^ 

TTCCTGGACCGCACCGGCGATGAAATCCACACCTCGATGGAGGCCGGCCCGATGGTGC^CA^GCrP 

accatgaagagccagccgtggatcttggcctacgaggaccacaacgtc 0 ^^ 

GGGTTCAGCGGCCGAGCCCAGGTCGGCAAGGGCATGTGGACAATGACCG^ 

gtcgagacaaaaatcgcccagccgcgcgccggggccagcaccgcctggot?cc^^ 
ggg accctgcatgcgctgcactaccaccaggtcgacgtcgc 

AAGCGTCGCGCCACCATCGAACAATTGCTGACCATTCCGCTGGCC^^^ 

gacgagatccgcgaagaggtcgacaacaactgtcaatccatcctcggc?a^g^ 
gatcaaggtgtcggctgctcgaaggtgcccgacatccacgacgtcgcgctcaSg^^ 

gtgcgggccagcttg^gcc^atggcgccgttggtcgatcgacaaaacgcgg^^ 

cgaccgatggcacccaacttcgacgac^gtatcgccttcctggccgcgcaSS^^^ 

gg^cccagcagcccaacggctacaccgagccgatcctgcaccgacgtcgSg^ 

CG ^ C ?SS A ^ 



ca^caccaaggatgtcacgacgttgggcc^^^ 

gccgcgctgggtgccgatgtctgtgaciatctacaccgacgtggacc^catcttca^ 
cgcatcgtgcgcaacgcccgaaagctcgacaccgtgacctt^^ 

tgcggcgc(^c^tgctgatgctgcgctgcgtggaatacgctcgccgccataata A ?c^ 

GTCCGGTCGTCGTACTCGGACAGACCGGGCACCGTCGTTGTCGGA^ 



\ 



1 
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FEATURES T „ _ • , 

Location/Qualifiers 
misc_feature 5072.. 5095 

mi5n f _ /note=»His tag coding region" 

misc_feature 5096.. 5293 s on 

/note =" Ra3 coding region" 
misc_feature 5294.. 6346 

/note="38kD coding reoion" 
rrusc_feature 6347.. 6643 r eg lon 

/note=»38-l coding reqion" 
misc_feature 6644.. 8023 9 n 



on " 



/note="FL TbH4 coding regi 

gttcgccggctttccccgtcaagctcta^SgSc?^ 
acggcacctcgaccccaaaaaacSg^ 

GACGGTTTTTCGCCCTTTGACGTTGGAGTCCACCTTC??^ 

aacaacactcaaccctatctcggtcta^^ 

TTGGTTAAAAAATGAGCTGATTTAACAAAAAT^ 

aatttcaggtggcacttttcgg^ga^^ 

ttcaaatatgtatccgctcatcS^ 

atttattcatatcaggattatca!^^ 

ACTCACCGAGGCAGTTCCATAGGATG^CA^G^TC 

CATCAATACAACCTATTAATTTCCCCTr^rAaaa^?^ TCGGT CTGCGATTCCGACTCGTCCAA 

TGGCAACGCTACCTTTGCC^TGT^ 
AGATTGTCGCACCTGATTGCCCGA^^ 

tgttc^aatttaatcgc^ 

tattactgtttatgtaagcagacagttttattgttStcac^ 
gtaatctgctgcttgc^caaaaaaac^ 

CTACCAACTCTTTTTCCGAAGGTAACTGGCT^ 
ATCCTGTTACCAGTCGCT^^ 
CGAACGACCTACACCGAACTGA^^ 
CC^GGGGGAAACGCCTGGTATCTTTATAGTC^TCTrrrrTT 
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TTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGAT 
AACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAG 
TCAGTGAGCGAGGAAGCGGAAGAGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGTOGTATT 

tc^caccgcatatatc^tgcactctcagtacaatctgctctgatgccgcatag™ 

CACTCCGCTATCGCTACGTGACTGGGTCATGGCTGCGCCCCGACACCCGCCAACAC^CGCTGACGC 

gccctgacgggcttgtctgctcccggcatccgcttacagacaagctgtgac^^^ 
catgtgtcagaggtttc^^ 

gtggtcgtgaagcgattcacagatgtctgcctgttcatccgcgtccagctcgttgagtttctccao 

AAGCGTTAATGTCTGGCTTCTGATAAAGCGGGCCATGTTAAGGGCGGTTTTTTCCTGTTTGGTrAr 
TGATGCCTCCGTGTAAGGGGGATTTCTGTTCATGGGGGTAATGATACCGATGAAACGAGAGA^ 

gctcacgatacgggttactgatgatgaacatgcccggttactggaacgttgtgagggtaI^cSc? 
ggcggtatggatgcggcgggaccagagaaaaatcactc^gggtcaatgccagcgcttcgttaatac 
agatgtaggtgttccacagggtagcc^gcagcatcctgcgatgcagatccggaacataatggtgca 
gggcgctgacttccgcgtttccagactttacgaaacacggaaaccgaagaccattcatcttgSg^ 
tcaggtcgcagacgttttgcagcagcagtcgcttcacgttcgctcgcgt^ 

ctaaccagtaaggcaaccccgccagcctagccgggtcctcaacgacaggagcacgatcatccgca? 

ccgtggggccgccatgccggcgataatggcctgcttctcgccgaaacgtttggtggcgggaccagt 

gacgaaggcttgagcgagggcgtgcaagattccgaataccgcaagcgacaggccgatcatcgtcgc 

gctccagcgaaagcggtcctcgccgaaaatgacccagagcgctgccggcacctgtcctacgagttg 

catgataaagaagacagtcataagtgcggcgacgatagtcatgccccgcgcccaccggaaggagct 

gactgggttgaaggctctcaagggcatcggtcgagatcccggtgcctaatgagtgagctaacttac 

attaattgcgttgcgctcactgcccgctttccagtcgggaaacctgtcgtgccagctgcattaatg 

aatcggccaacgcgcggggagaggcggtttgcgtattgggcgccagggtggtttttcttttcacca 

gtgagacgggcaacagctgattgcccttcaccgcctggccctgagagagttgcagcaagcggtcca 

cgctggtttgccccagcaggcgaaaatcctgtttgatggtggttaacggcgggata^catgagc 

tgtcttcggtatcgtcgtatcccactaccgagatatccggaccaacgcgcagcccggactcggtaa 

tggcgcgcattgcgcccagcgccatctgatcgttggcaaccagcatcgcagtgggaacgatgccct 

cattcagcatttgcatggtttgttgaaaaccggacatggcactccagtcgccttcccgttccgcta 

tcggctgaatttgattgcgagtgagatatttatgccagccagccagacgcagacgcgccgagacag 

aacttaatgggcccgctaacagcgcgatttgctggtgacccaatgcgaccagatgctccacgccca 

gtcgcgtaccgtcttcatgggagaaaataatactgttgatgggtgtctggtcagagacatcaagaa 

ataacgccggaacattagtgcaggcagcttccacagcaatggcatcctggtcatccagcggatagt 

taatgatcagcccactgacgcgttgcgcgagaagattgtgcaccgccgctttacaggcttcgacgc 

cgcttcgttctaccatcgacaccaccacgctggcacccagttgatcggcgcgagatttaatcgccg 

cgacaatttgcgacggcgcgtgcagggccagactggaggtggcaacgccaatcagcaacgactgtt 

TGCCCGCCAGTTGTTGTGCCACGCGGTTGGGAATGTAATTCAGCTCCGCCATCGCCGCTTCCACTT 

tttcccgcgttttcgcagaaacgtggctggcctggttcaccacgcgggaaacggtctgataagaga 

CACCGGCATACTCTGCGACATCGTATAACGTTACTGGTTTCACATTCACCACCCTGAATTGACTCT 
CTTCCGGGCGCTATCATGCCATACCGCGAAAGGTTTTGCGCCATTCGATGGTGTCCGGGATCTCGA 
CGCTCTCCCTTATGCGACTCCTGCATTAGGAAGCAGCCCAGTAGTAGGTTGAGGCCGTTGAGCACC 
GCCGCCGCAAGGAATGGTGCATGCAAGGAGATGGCGCCCAACAGTCCCCCGGCCACGGGGCCTGCC 
ACCATACCCACGCCGAAACAAGCGCTCATGAGCCCGAAGTGGCGAGCCCGATCTTCCCCATCGGTG 
ATGTCGGCGATATAGGCGCCAGCAACCGCACCTGTGGCGCCGGTGATGCCGGCCACGATGCGTCCG 
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TCATCATCATCACGTGA^^ 

ggg cgtggacagcgccggcaagatcacctaccgcatcaagctcgaagtg^ 



G CTGTTCAACCTGTGGGGTCCGGCCTTTCACGAGAGGTATCCGAACG^ 



TCGGAAGGTGATATGGCCGC ^ 
GGCCATGTACCAGGGCACCATCAAAACCTGGGACGACCCGCAGATCGCTGCGCTC^?^ 

gaacctgcccggcaccgcggtagttccgctgcaccgctccgacgggtccggtgIca^cS 

CACCCAGTACCTGTC^GCAAGATCCCGAGGGCTGGGGCAAGTCGCCcS^^ 

cgacttcccggcggtgccgggtgcgctgggtgagaacgggaacggcgg^ 

ggag acaccgggctgcgtggcctatatcggcatcagcttcctcgaccag^^ 

cggcgaggcccaactaggcaatagctctggcaatttcttgttgcccgacgcScaIa^ 

cgcggcggctggcttcgcatcgaaaaccccggcgaaccaggcgatttcgatga^acg^ 

cccggacggctacccgatcatcaactacgagtacgccatcgtcaacaa?cgg?^ 

caccgcgcagaccttgcaggcatttctgcactgggcgatcaccgacggcaa?^ 

cgaccaggttcatttccagccgctgccgcccgcggtggtgaagttgtc^^ 

gatScSgS 

GATCTCCGGCGACCTGAAAACCCAGATCGACCAGGTGGAGTCGACGGCAGGTTCGTTGCArrrrra 

gtggcgcggcgcggcggggacggccgcccaggccgcggtggtgcgcttccSgSgcag^ 

G^GAAGCAGGAACTCGACGAGATCTCGACGAATATTCGTCAGGCCGC^^ 

CGACGAGGAGCAGCAGCAGGCGCTGTCCTCGCAAATGGGCTTTACTCAGTScAt^ 

GGATC^GCAAGAGATTTTGAACAGGGCCAACGAGGTGGAGGCCCCGATGGC^CcS^^ 

CGACAACATGCGGGAATACCTGGCGGCCGGTGCCAAAGAGCGGCAGCGTCTGGCG^CCTCG^ 
CAACGCGGCCAAGGCGTATGGCGAGGTTGATGAC^ 

AOGAACTGTGCAGGCAGAATCGGCCGGGGCCGTCGGAGGGGACAGTTC^ 

gccgagggtggccacggccggtgaacccaacttcatggatctcaaagaagcS 

AACGGGCGACCAAGGCGCATCGCTCGCGCACTTTGCGGATGGGTGGAACACTT^?^ 

GCAAGGCGACGTCAAGCGGTTCCGG(^GTTTGACAACTGGGAAGGCGATCCG^ 

GGCTTCGCTCGATCAACAAC(^CAATG<3ATACTCCACATG^^ 

g^ctcaatatgtcgcgcagctgcacgtgtgggctaggcgggaaS^^ 
ag tcc^ctcgaacggctttacgcggaaaac^ 

ggagtatcagcagac^tcc^agaaggtgctgaccgaatacaacaa<^S^ 
aaacccgccg^ctc^ 

gatccctggcttcctgatgccgccgtctgacggctccggtgtgactcccggta^o^g^Sg? 
cgcaccgatggttccgcctaccggatcgccgggtggtggcctcccgg^ 

gacgtcggctgggcgggaagccgcagcgctgtcgggcgacgtggcggtc^ 
tc^cggtggaggcggcggggtgccgtcggcgccgttgggat^^^^ 
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-GAGGATCGGGCATGGACCGAGGCCGTCP 
3TGAATTCTGCAGATATCCATCACACTGG 

"^CTGCTGCC^CCOCT^™ 
GTTTTTTGCTGAAAGGAGGAACTATATCCGGAT -TAAACGGGTCTTGAGGG 



CGCTCGAGCACCACCACCACCACCACTGAGATCCGGCTGCTAACAAAGCCCGAAAC 



Figure 3 : Amino Acid Sequence of TbF14 

KDVILESAITTIMDFEDSVAAVDAADKVLGYR^ 
GGGQFTLPGRSI^FVRNVGHI^TNDAIVDTDGSEVFEGIMD^^ 

tgsiyivkpkmhgpaevaftcelfsrvedvlglpq™ 

INTCFUJRTGDEIHTSMEAGPMVRKGTMKSQPWIIAYEDHNVDAGI^ 
MADMVETKIAQPRAGAST^^ 

JWAPDEIREEVD^CQSI^ 

tsadvraslermaplvdrqnagdvayrpmapnfddsiafi^ 

dlldiaqqvcpapppreldmlltagerisnalva^ieslgaharsftgsqagvit^t^naki? 

dvtpgrlqtaleegrwlvagfqgvsqdtkdvttlgrggsdttava^ 

sadprivrnarkldtvtfeemlemaacgakvlmlrcveyar^ 

dvpmedpiltgvahdrseakvtivglpdipgyaakvfravarrrrq 

hllpqtsgpppwknwtrsetrsastqllyddhigkvsligagmrs^ 

istsedqrsrccaatpnwtrpwsrcmkrsgsaatrrprctrgrdgrwacq^^^^^^ 



Figure 4 : Amino Acid Sequence of TbF15 

MGHHHHHHVIDIIGTSPTSWEQAAAEAVQRARDSVDDIRVARVIEQDMAVDSAGKITYRIKLFVqp 
KMR PAQ PRCGS KPPS GS PETGAGAGTVATT PAS S P^^LAETGS TLL YPLFNLWGPAFHER^NVT I 
TAQGTGSGAGIAQAAAGTWIGASDAYLSEGDMAAHKGLMN^^ 
KVI^QGTIKTWDDPQIAAI^PGVNLPGTAWPLHRSDGSGDTFLFTQY^ 

gttvdfpavpgalgengnggmvtgcae^ 

A I VNNRQ KD AA r 

!^?Y hfqplppaw ^ sdaliatissaemktd aatlaqeagnferisgdlktqidqvestags 



siqaaaagfasktpanqaismidgpapdgypiinyeyaiWq^WqtlSaflhwa^ 

ASFLDQVHFQPLPPAWKLSDALIATISSAEMKT™*™ -- _ ^ J-irlWAI TDGNK 

LQGQWRGAAGTAAQAAWRFQEAANKQKQELDE I 



TVTVDQQEII^RANEVEAPKADP^ 

TSLRNAAKAYGEVDEEAATALDNDGEGTVQ^^ 

RKLETGDQGASI^FADGWNTFNLTLQGDVKRFRGFDNWEGDAATACEASLDQQRQW^L^ 
AMAKQAQYVAQLHWARREHPTYEDIVGLERLYAENPSARDQILPWAEYQORSE^TF^ 

lepvnppkpppaikidpppppqecwlipgf^ 
aaQltsagreaaalsgdva^^ 

gaalggggmgmpmgaahqgqggakskgsqqedealytedrawtea^ 
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TbFI5 TbF6 



1,045 




Bg^pL635' 
?&429 




W2J82 
iffiQ&lJ 0.317 



0.183 

0.061 

0.066 

0.021 

0.053 

0.114 

0.105 

0.101 

0.080 

0.140 

0.088 

0.081 

0.057 

0.104 

0.221 

0.257 

0.056 

0.184 

0.126 

0.193 

0.058 

0.183 

0.062 

0.134 

0.113 
0.061 
0.298 



0.314 
0.063 
0.142 
0.115 
0.289 
0.238 
0.146 
0.237 
0.071 
0.117 
0.072 
0.089 
0.06 
0.111 
0.241 
0.265 
0.093 
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0.126 
0.092 
0.057 
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0.157 
0.086 
0.414 
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rrCC^l.5eq..T:pd (1 > 1200) Site and Se^tjenca ' .-— >' 

inzymes : All 5 1 5 enzymes (No 

le-iincs : Circular. Certain Siies^Br Standard Gene-tic Cede 



CAGGCA 7GAGC AG AGCG T7C A 7CA f CG A 7CC A ACC A 70 AG 7GCC A 7 7G AC GGC 7 7G T AC G A CC 7 7C rGGGG-.-"GG^ ~ACCCA,ACCAAGGGGG 7 A 7CC 7 

100 



■ TGGG f 7GG 7 FCCCCCA TaGC 



'HTCC-1 FL 3 



M S 2 A F t ( 0 ? 7 i 5 A t 0 G L Y 0 L L G [ G t ? M Q G G 1 L 

T PAC 7CC TCAC f AGAG7 AC 7 7CGAAAAAGCCC 7GGAGGAGC 7GGCAGCAGCC7 7 7CCGGG 7GA 7GGC ~GG77AGG77CGGCCGCGGACAAA 7ACGCCGGC 

; 1 ■ : ? i = : '■ •■ I i ■ 1 i : 1 = r 200 

AA7GAGGAG7GA7C7CA7GAAGC 7 7 77 i CGGGACC 7CC TCGACCG 7CG 7CGCAA AGGCCCAC fAC CG A CCA A "CCA AGCCGGCGCC TG T 7 TA 7GCGGCCG 



t'rtTCC-] FL 1 



Y 5 5 L E Y F £ K A L E £ L A A A F ? G 0 G W L G 3 A A 0 K Y A G 

AAA AACCGCAACCACG7GA A TT 7 "77CCAGGAAC7GGCAGACC7CGA7CG7CAGC7C A 7CAGCC7GA7CCACGACCAGGCCAACGCGG7CCAGACGACCC 

i 1 * i ' ! 1 ' 1 ' ' ' ■ ; = ■ = ' r 300 

77777GGCG77GG 7GCACT7AAAAAAGG 7CC 7 7GACCG i C i GGAGC t AGCAG I uG AG I AG i lGGAC 7AGG7GC 7GG7CCGG 77GCGCCAGG 7C 7GC 7GGG 



'HTCC-1 FL 1 



K N R N H V N F r G E L A 0 L C 3 Q L [ 5 L ! H C Q A- N A V G - 7 7 

S GCGACA7CC7GGAGGGCGCCAAGAAAGG7C7CGAG77CG7GCGCCCGG7GGC7G7GGACC7GACC7ACA7CCCGG7CG7CGGGCACGCCC7A7CGGCCGC 
ff 1 . CGC 7G7AGGACC7CCCGCGG 77C77 7C C AGAGC7C A AGCACGCGGGCC A CCGAC AC C 7GGAC7GGA 7G ~AGGGCCAGCAGCCCG7GCGGGA7AGCCGGCG 

CO 



100 



•HTCC-1 FL 1 



a Q { L £ G A ;< k G L E F V « ? V A V 0 L 7 Y ( P V v G H A L\ S. .A A 

'"^ C 7 7C C A GGCGCCG 777 7GCGCGGGCGCGA 7GGCCG7AG7GGGCGGCGCGC 77GCC TAC 7TGG7CGTGA A AACGC7GA7CAACGC GAC7CAAC 7CC7CAA A 
_j: GAAGG7CCGCGGCAAAACGCGCCCGCGC7ACCGGCA7CACCCGCCGCGCGAACGGA7GAACCAGCAC7777GCGAC7AG77GCGC7GAG77GAGGAG777 



500 



1 HTCC-1 FL 1 



FQAP FCAGAi1AVVGGALAYLVVK7L[ NA7GLLX 

7 7GC 7 7GCC AAA 77GGCGG AG 77GG7CGCGGCCGCCA77GCGGAC A 7C A 7 77CGGA7G7GGCGGAC A 7CA7CAAGGGCACCC7CGGAGAAG7G7GGGAGT 

! ! , ; 1 i i ■ ■ i 1 i s i ! : 1 j i r 

AACGAACGG777AACCGCC7CAACCAGCGCCGGCGG7AACGCC 7G7AG7AAAGCC 7ACACCGCC7G7AG7AG 77CCCG7GGGAGCC 7C 77CACACCC 7CA 



600 



'HTCC-1 FL 1 



LLAKLAELVAAA [AO! £ SO'/ AO [ [XGTLGEVWE 

7CA7CACAAACGCGC7CAACGGCC7GAAAGAGC777GGGACAAGC7CACGGGG7GGG7GACCGGAC7G77C7C7CGAGGG7GG7CGAACCTGGAG7CC77 

i i 1 ■ 1 i ! ! — < f 1 i i i 1 1 1 i r 700 

AG7AG7G777GCGCGAG77GCCGGAC777C7CGAAACCC7G7I CGAG7GCCCCACCCAC7GGCC7GACAAGAGAGC7CCCACCAGC77GGACC7CAGGAA 



•HTCC-1 FL' 



F [ 7NALNGLKELW0XLTGWVTGLFSRGWSNLESF 

CTTTGCGGGCGTCCCCGGCrTGACCGGCGCGACCAGCGGCTrGTCGCAAGrGACTGGCTTGTTCGGTGCGGCCGGTCrGTCCGCATCGTCGGGCTTGGCT 

, j , j , 1 1 ; 1 i = 1 i i i . i 1 ! 1 — H 300 

GAAACGCCCGCAGGGGCCGAACTGGCCGCGCrGGrCGCCGAACAGCG7TCAC7GACCGAACAAGCCACGCCGGCCAGACAGGCGTAGCAGCCCGAACCGA 



-HTCC-1 FL 1 



F A G V PGL7GA7SGLSGV7GLFGAAGLSASSG 



Icnday. July 25, 1 9S3 1 0:42 AM ... 0 

TCCtfl.see.mcd, M > 1200) Site and r ^ 



:c r T^Br::':crc:cccGCA rroGGGGCGGG recce 77 77GGGd^Mtcc 

: j ■ 1 i : -r^K—; 



CACGCGGA 7AGCC 7GGCGAGC 7CAGCCAGC 7f^PTGCCC ; CG^-GGCA . i GGGGGCuGG t -^CGG \ « , i GGGC^MtCCGAGCC TCGC Jr iGG 7CC A jr. 

: ; ! ' : ■ ^ T7- i , 1 c 0O 

G 7GCGCC 7A 7CGGACCGC TCGAG 7CGG f CGAACGGGCGGGACCGGCCG 7AACCCCCGCCCAGGCCAAAACCCCCGA ACGGC 7CGGACCGAG 7CCAGGTAC 



■h'TCC-l FL 3 



H A Q 5 L A 3 S A S L r A I A G I G G G 5 G F G G L ? 3 L .A G V H 
CCGCC TCAAC TCGGCAGGCGC 7ACGGCCCCGAGC 7GA 7GGCCCGG 7CGGCGCCGC TGCCGAGCAGG 7CGGCGGGCAG7CGCAGC 7GG TCTCCGCGC AGGG 

, : ■ : i : = i 5 ' j : : - i j ; 1 > r rOOC 

GGCGGAG TTGAGCCG TCCGCGA 7GCCGGGGC 7CG AC 7 A CCGGGCCAGCCGCGGCG ACGGC 7CG fCCAGCCGCCCG 7CAGCG7CGAC C AG A GGCGCG fC CC 



'HTCC-1 FL 1 



A A S raaALSPSAOGPVGAAASQVGGQSQ 



L V 5 A a G 



TTCCCAAGGTATGGGCGGACCCGTAGGCA TGGGCGGCA 7GCACC CC 7C7TCGGGGGCG 7C GAAAGGGACGACGACGAAGAAG TAG 7CGGAAGGCGCGGCQ 

• i i i j '■■ : • ! ■ j i ■ . j- 1 100 

A AGGG TTCC A TACCCGCC "GGGC A TCCGT AC CCGCCG 7ACG7GGGGAGAAGCCCCCGCAGC 777CCC7GC 7GC 7GC7TC7TCATGAGCCT7CCGCGCCGC 



■HTCC-1 FL ' 



S 0 G M GGPVGMGGMHPSSGASKGTTTKKYSSGAA 

r~ GCGGGCAC 7GAAGACGCCGAGCGCGCGCCAG "CGAAGC 7GACGCGGGCGG 7GGGCAAAAGG 7GCTGG 7ACGAAACG7CG7C7AACGGCA7GGCGAGCCAA 

^ : i s ■ i i = i ' 5 i 5 ' i i ! i i i *> 1200 

^ CGCCCGTGAC7TC7GCGGC7CGCGCGCGG7CAGCT7CGAC7GCGCCCGCCACCCG7T7TCCACGACCA7GC777GCAGCAGArTGCCGrACCGC7CGGT7 

ffs 

B HTCC-1 FL > 

A G T E Q A £3AP VcAOAGGGQKVL V 8 n V V 



03 

m 
Si 

ru 

M 
□ 



ionday, July 26. 1999 10:49 AM A/TOC-J- \J - J-*J 

7CCl(1 : 232) Mao.mcd (1 > 725) Site an^^ ' . zs 
inzymes : 212 of 515 enzymes (Filtered) .^^B 

■aftincs: Linear, Cartain Sites ^tt/. S^rdard Genetic Cede 



A7GCA7CACCA rCACC.i7CAC.-i 7GAGCAGAGCG7 fCA TC-. 7CGA 7CC AACGA rCAG FGCC A 7 7GACGGC F \ '~ i "~ v--- r ,--V , , , _ , . — 



7ACG 7AG7GG 7AG7GG 7 AG TG 7AC7CG 7C "CGCaaG TAGTaGC i"ACG 77GC TAG FCACGG 7 A AC TGCCGAACA 7CC7GGAAGA<~ *CC 1* A * "C 
rt H H H H H H M 5 * A F i I 0 ? 7 { 5 A ( 0 G L Y <*• i 



! i- ICO 

77a TGGG 7 



L G ( G ( ? 

ACCAAGGGGG 7A7CC 7 7 TAG FCC7CAC7AGAG "AC 7 7CGAAAA AGCCC TGGAGGAGC 7GGCAGCAGCG 777T "G"G"" * Terr -r~r- r~rr~r 

! : ' ! ! : ! : ! 1 : ! ^ ■ ■ : : _ ^r.n 

7GG 77CCCCCA 7AGGAAA 7GAGGAG 7GA 7C 7CA 7GAAGC7 7777CGGGACC 7CC ; CG ACCG TCG rCGCAAAG r - r ~ ~ ir ~ > r ~r -r- < , P -- rr ,-A " 

N Q G G I L Y 3 S L E Y ? £ K A L £ £ L A A A c ? c 0 Q tf L G s a a . 

CGACAAArACGCCGGCAAAAACCGCAACCACG7GAAru^ 

CC7G77TA7GCGGCCG77777GGCG77GG7GCACr7AAAAAAGG7CC77GACCG7C7GGAGC7AGCAG7CGAG7AG7CGGAC7AGGrGcfGG7r C GG77G ^ 
OXYAGJCNRNHVNf.-Q£UAOLORQ L ( SL[HOQAN VJ 

GCGGTCCAGACGACCCGCGACA7CC7GGAGGGCGCCAAGAAAGG7C7CGAGr7CG7GCGCCCGG7GGCTG TGGACC7GACC7ACAiCCCGG7CG7CGGGC 
CGCCAGG 7C TGC 7GGGCGC TG TAGGACC 7CCCGCGG 77C777CCAGAGC 7CAAGCACGCGGGCCACCGACACC 7GGAC 7GGA 7G7AGGGCCAGCAGC r C^ ^ 
AVQT7R0 [ L £ G A :< :< G L £ F V R P V A V 0 I, f y £ a V V G ' 

ACGCCCTA7CGGCCGCC77CCAGGCGCCG7777GCGCGGGCGCGA 7GGCCG 7AG7GGGCGGCGCGC 7 7GCC 7AC "7GG 7CG 7GAAAACGC 7GA7CAACGC 

^ 7GCGGGA7AGCCGGCGGAAGG7CCGCGGCAAAACGCGCCCGCGC7ACCGGCA7CACCCGCCGCGCGAACGGA7GAACCAGCAC7777GCGAC7AGT7GCG ^ 
j^HALSAAFQA?i-CAGAMAVVGGALAYLVVK7L [ N A 

ijf GAC7CAAC7CC7CAAA77GC77GCCAAA 77GGCGG AG 7 7GG7CGCGGCCGCC AT 7GCGGAC A 7CA 7 77CGGA 7G7GGCGGACA 7CA7CAAGGGCATCC7C 

Ol C7GAG77GAGGAG 77 7AACG A ACGG 7 77A ACCGCC 7CAACCAG CGCCGGCGG 7A ACGC C TG 7AG 7 A AAGCC7ACACCGCC7G7AG7AG77CCCG7AGGAG~^ 00 

S| 7 a l l :< l l a ;< l a e l v a a a i a o i e 5 0 v a 0 r. e k g- " t " l 

H ~ GGAGAAG TGTGGG AG 7 TC A TCAC AAA CGCGC 7CA A CGGCC 7GAAAGAGC 7TTGGGAC A AGC 7C ACG GG G 7GGG TGACCGG AC TG TTC TC TCG A GGG TGG T 

Q CC TCTTC A CACCCTC A AG TAG TGTT TGC GCGAG 7TGCCGGACT TTC 7CGA A A CCC7G7TCG AG TGCCCCACCCA C TGG CCTG AC A AGAG AGC TCCCACCA ^ 
^ G E V W £ F [ T N A L N G L K £ L W 0 :< L T .-G W V 7 G U F 5 R G W 

3 CGAACCTGGAGTCCTTCTAAGAA 7TC 

! i ! i H*. 728 

r: GCTTGGACCTCAGGAAGATTCTTAAG 
^ SNLESF . £F 



Aid- fa 



***** July 25. 1999 10:50AM H I CC - 1 L l ^ " 

iTCCl (13^-392) Map.mcd (1 > 351) 3;i2 and 5 .-nee f 
inr/mes : 21 2 or 51 5 enzyrr.ss (rtllsfsd) 

■aaincs: Linear. Cartain Sites j^Ktandard Genedc Ccca 



a 7gca 7c acca rc acca tcacga rc 7ggcggac a tca r c aagggca 7cc r cggagaag 7g fgggag r :ca tcacaaacgcc-c ~ca— cggcc 7ga a agagc 

I i : ■ : : i i 5 ! : : ■ : r ICQ 

TACG "AG TGG TAG 7GG 7AG7GC 7ACACCCCC 7G TAG : AG 7 7CCCG 7AGGAGCC 7C 7 7CACACCC 7CA AG TAG 7 C 7 7 7GCGCGAG 77GCCGGAC 7 7 7C 7CG 

H H H H K H 0 V A 0 I i '< G ( L G £ V W £ ;- [ 7 N A (_ N G L X £ 
77 7GGG AC A AGC TCA CGGGG 7GGG 7GACCGG AC 7G 77C 7C 7CGAGGG7GG 7CGAACC 7GGAG 7CC7 7C 7 7 7GCGGGCG 7CCCCGGC 7 7GACCGGCGCGAC 

f : . ! ■ i i : = ' ! = : = = i ! j i » 200 

AAACCC 7G 7 7CG AG 7GC CCC AC C C AC 7GGCC 7GAC A AG AG AGC 7CCCACC AGC 77GGACC 7CAGG A AG A A ACGCCCGCAGG . 
L. W 0 K L 7 G W V 7 G L F 3 R G W 3 N L £ S A G V ? G L 7 G A 7 

CAGCGGC77G 7CGCAAG7GACrCGC 7 7G77CGGrGCGGCCGG7C7G7CCGCA7CG7CGGGC7 7GGC7CACGCGGA7AGCC7GGCGAG-7CAGCC4GC77G 

■ i ■ ! ' l - • 5 1 1 1 ! i ' i 5 ■ i ~ 300 

G7CGCCGAACAGCG77CAC7GACCGAACAAGCCACGCCGGCCAGACAGGCG7AGCAGCCCGAACCGAG7GCGCC7A7CGGACCGC7CGAG7CGG7CGAAC 

SGLSQVTGLFGAAGL3ASSGLAHA0SLASSASL 

CCCGCCC7GGCCGGCA77GGGGGCGGG7CCGG7777GGGGGC7TGCCGAGCC7GGC7CAGG7CCA7GCCGCCrCAAC7CGGCAGGCGC7ACGGCCCCGAG 
i j ■ i ' i ■ i = i i i i >. i : : ; ■- i-QQ 

GGGCGGGACCGGCCG7AACCCCCGCCCAGGCCAAAACCCCCGAACGGC7CGGACCGAG7CCAGG7ACGGCGGAG77GAGCCG7CCGCGA7GCCGGGGC7C 
P ALAG IGGG5GFGGL?3LAQVHAA5TSQAL«PR 

C7GATGGCCCGG 7CGGCGCCGC TGCCGAGCAGG 7CGGCGGGCAG 7CGC AGC 7GG 7C 7CCGCGCAGGG7 7CCCAAGG "A 7GGSCGGACCCGTAGGCA TGGG 

i : • : ■ ■ i i ' i ' : i i ! ■ i : . "- $QQ 

q GAC7ACCGGGCCAGCCGCGGCGACGGC7CG7CCAGCCGCCCG7CAGCG7CGACCAGAGGCGCG ! CCCA AGGG77CC A 7ACCCGCC7GGGC A 7CCG7ACCC 
iyA0G?VGAAA£QVGGQ5GLVSAQG5QGi1GGPVGMG 

m 

CGGCA7GCACCCC7C 7 7CG GGGGCG 7CG AAAGGGA CGACGACG A AGA AG 7AC 7CGGA AGGCGCGGCGGCGGGC AC 7G A AG ACGCCGAGCGCGCGC CAG 7C 

rr f i 1 1 ! i i i ■- i '■ s : • i > i i — r-8G0 

jfjf GCCG7ACG7GGGGAGAAGCCCCCGCAGC777CCC 7GC 7GC7GC 7TC 77CA 7GAGCC 77CCGCGCCGCCGCCCG 7GAC 7 7C "GCGGCTCGCGCGCGS7CAG 

G H P SSGAS:<Grrr:<*<YS£GAAAG7£0A£R"A.-'P*V 

Pj GAAGC7GACGCGGGCGG7GGGCAAAAGG 7GC 7GG7ACGAA AC G7CG 7C 7AACGGCGAA7TC 

. i i . 1 ■ ■ ■• i ' i ' ^ 53! 

^ C77CGAC7GCGCCCGCCACCCG7777CCACGACCA7GC777GCAGCAGA77GCCGC77AAG 

|=| £ A 0 A G G G G K V L V R N V V . • R R E 

□ 

D 



/icrdav July 25, 1 93S 10: -3 AM 

,1001(1-129)^0.^0 <1>411) Sitaarc. querca 
All 515 enzymes 
_ Circular. Certain 

ATGCA 



ri i C£_- J- vj- ; ) 



Rags 1 



: .nr/rr.es 



Standard Gan eclc Ocde 



C - 



s a 



AA 



TACCCAAC CAAGGGGGTA'CC TTTAC'CCTCACrAGAGTAC ITCGAAAAAGCCC rGGAGGAGCTGGCAGCAGCGTTrCCCGGTGA TGGC7 
G-TCCCCCATAGGAAATGAGGAGTCArCTCArGAAGcfTrrrCGGGACCTCCrCGACCGrCGTCGCAAAGGCCCACTACCGA 



13'4 



TTArGGGTTGG _ _ 

, p „ Q G G i L Y 3 S L Z Y f E K A L , L A A A F P G 0 G 

fff; 7TAGGTTCGG CCGCGGACAAA TACGCCGGCAAAAACCGCA ACCACG ~GAA T TCCA.GGAACTGGCAGACC TCGATCGTCAGC TCATC ^ 

. G - CGGC GCCTGTTTATGCGGCCGTTrTTGGCGTTGGTGCACTTAAAAAAGGTCCTrGACCGrCTGGAGCTAGCAGTCGAGTAG 
CCAA l H-AA u - rnhvN cfqc LA0L0RQL .I 




□ TGTGGACCTGACCTACATCCCGGTCGrCGGGCACGCCC^ATAG 



m ACACCTGGACTGGATG-AGGGCCAGCAGCCCGTGCGGGATATC 
BJvOLTYlPVVGHAL. 

do 
m 
si 
m 



D 

□ 

Q 
O 



Sonday. July 25, 1SSS 10:43 AM 
JJOjafcTfiCUaPsi-l 1 >. 1525) Site an d aegjH^- 
^r/mes : • • Ali o o^,- G-« a tic Coda 

. arancs: - in — • " - - . - - r - r 



francs: --> j !?77^^^ 

>— Met /HIS TAG- II Q G G q G A ! ? < G G A rt A 

n H H » H H H T A A - 



1 



1 CO 




- 200 



— " 7T. : G p T a " L G L G V V 0 N N G N G. A 

-3 " G G G 3 * ' 

AU W • \ '* ! " r,r .^r.rr.r.rrfiC TGCACTAG7GGCGCeAGC7GCCGCGAGGCTAGrTGAGCCG3 



irGAGTC r AACGCGTGG'CGGGA'oUbu 1 — ~ ■ - . . , , : ; , 1 : = 

111-, : ■ ^ ^ r „ r r ^^- r c - t , 7GaGAGCCG"AGAGG~GGCCGC TGCAC TAGTGGCGCCAGCTGCCGCGAGGCTAGTTGAGCCG 

TGC TCAGG TTGCGC AC CAGCCC i CG.GAG^GCu.T i^Au _ ^ _ 



± 300 
u3 



TTTTTa p a a s l g . s r g 0 v i t a v 0 g a p « n s a 

r.-irfTi rr r r~G TGACC7GGC AAACC A AG 7CGGGCGGCACGCG7 ACAGGGAACG7G ACA T 

■ rrrrr. -r.r.rr.GACGCCC TTAACGGG CA ■ CA , uC.-, . . ^C, " , ; ■ i i i ' ■■ ; 

AO.GCA.L^ , , , . :,....--,rrrrrrr.r.77ri,-rrCCCrQTGCGCATGTCCCTTGCACiG^ 



ir -- r . v -----i-ir-CGCrTAAcaiuL * . ■ u..-^ ■ — , , , : ■■ ■ 

ACGC.A.CG. ; . _ r _ G CTGGACCGrTTGG r-CAGCCCGCCGrCCGCATGTCCCTTGCAC . u . A 

TGGCGCTACCG CCTGCGCGAArTGCCCGfAG.A^CCA^.CAG.AG Ao^ ^ _ _ 

| . L N G H H P G 0 » iVv T W Q T :< 5 G G T R T G N V T 

fe, 7 ^ .l A ^ ~ ~~ ~ ' ' ~~ ~ r *^''2^ A '^^A^A^GTTCAA~GAGC AGAGCG 7 7C A 7CA 7CGA7C CAACGA 7C AG7GCC A77GACGGC77G7ACG^A 

(^TuGCCGAGG.ACCL^^ , . , _ ^ r7 , r7inrr .r^rAACtGCCGAACA-TGCT 



400 



,-^- rrr . rr ^ArrCCCGGCCGAAi itLi.^i^u^.^ ~ ; ; ; ■ : ' • 

^GCCGAGG.ACCuCCG. ^ f --r«---rr "AC 7CG7C 7CGC AAG7AG 7AGC 7AGG77GC7AG 7C ACGG7A AC7GCCGAAC A-7SC t 

' ] f ---"*•*"■*" T ~~ r:r:i '" l "'"GC ~~ AAGGA TCA i GGA i C i CCAAu • t Ai. t CG _ , 

3 — rTrr^^rb- v'rrG r \ ° > < ~ * •« v 

p GGAAGACCCCTAACCi lAibGGTiGui iCCC-LAi. -, T "I - 



.^500 



500 



:< a L £ 



L A A A F P G 



-i- 700 



0 G W L 0 S A A 0 K Y A G < H R N H 

- rr A rrrrr~ACA7CC7GGAGGGCGCCAAGAAAGG7C7CGAG77CG7GCGCCCGG7GGC7G^7GG^^CT 



^^^^ 



SLl H0aANAVQ.„TR0ll-- 

■ TT ->.-^^^rrTrr.r ft rr.r.nr fi C 3 ATGGCCGTAGTGGGCGGCGCGCTTGCCTAC.TG 



300 



900 



T Y I 



777777777. --«»«»-.-'«----'- 



cnday, July 25, 1399 10: -3 AM 

VTr^hTCCl.-nc^ f1 > 1S2^ 3ita and Secure " f 



1 7CG7GAAAACGC7GA 7CAACGCGAC 7CAAC 7.C^^^^A7 7CC 7 7GCCAAA 77GGCGGAG 7 7GG 7CGCGGCCGC CGGACA7C~7i TCCGA 7G7GG 

* . • ~ . ~ ~ * ~ I CGO 

IAGCAC 7 777GCGAC7AG 7TGCGC 7GAG 7 7GAGGAG 77 "A ACGAACGG 7 7 7AACCGCC TC A AC CAGCGCCGCCGG 7 iACGCC'G "AG 7AAAGCC 7ACACC 

bm |M1__.L I BI H ' ' — — — — 3— i^— — f, ~C C 1 11,1 11 I II I 

v v : < r L I N A T C L (1 X <- L A X L A i L V A A A I A 0 [ ! S 0 V 
f-".rir' 1 rf-"-TC' rrr 7rcGAG A AG TG TGGGAG TTC A TC AC AAACGCGC7C A ACOGCCTG A A AGAGCTTTGGGAC A AGCTCACGGGG"GGG~G AC 

GCrT r rir.7 7rc r G7AGGAGCC 7C77C AC ACCC7C A AG 7AG7G 7 7 7GCGCG AG 77GCCGG AC 7 7 7C 7CGAAACCC 7G7 7CGAG 7GCCCCACCCAC 7G 

A 0 { ( :< G ( L G £ V W £ F I 7 N A L N G L X £ L W 0 X L 7 G W V 7 
CGGAC TGTTCTCrC GAGGGrGGrCGAACCTGGAG7C 

GCC7GAC4AG4GAGC7CCCACCAGC77GGACC7CAGGAAGAAACGCCCGCAGGGGCCGAAC7GGCCGCGC7GG7CGCCGAACAGCG77CAC7GACCGAAC 

GLFS?i GWSNLc5i r rAGVPGL7GA7SGL5QV7GL 

77CGG 7GCGGCCGG7C7G7CCGCA 7CG7CGGGC77GGC 7CACGCGGA 7AGCC 7GGCGAGC 7CAGCCAGC 77GCCCGCCC 7GGCCGGCA 77GGGGGCGGG 7 
AAGCCACGCCGGCCAGACAGGCG7AGCAGCCCGAACCGAG7GCGCC7A7CGGACCGC7CGAG7CGG7CGAACGGGCGGGACCGGCCG7AACCCCCGCCCA 

— hTCC 1 ■ 
3 F G A A G L S A 5 S G L A H A 0 S L A 5 S A 5 L P A L A G I G G G 

1CCGG7T77GGGG GC77GCCGAGCC7GGC7CAGG7CCA7GCCGCC7CAAC7CGGCAGGCGC7ACGGCCCCGAGC7GA7GGCCCGG7CGGCGCCGC7GCCGA ^ 

5gGCCAAAACCCCCGAACGGC7CGGACCGAG7CCAGG7aCGGCGGAG77GAGCCG7CCGCGA7GCCGGGGC7CGAC7ACCGGGCCAGCCGCGGCGACGGC7_ 
^ * TT" J i . ' 1 

: SGFGGL? 5LAQVHAA5 7 RQALRPRA0 G ? V G A ' A A " £ 

^CACGTC GGCGGGCAGTCGCAGCTGGTC^ ^ qq 
CGTCCAGCCGCCCGTCAGCGTCGACCAGAGGCGCG7CCCAAGGGT7CCA7ACCCGCCTGGGCA7CCGTACCCGCCG 7ACG7GGGGAGAAGCCCCCGCAGC 



■hTCC1 



QVGG a S aLVSAQGSQGMGGPVGMGGMHPSSGAS 

AAAGGGACGACGACGAAGAAG 7ACTCGGAAGGCGCGGCGGCGGGCAC7GAAGACGCCGAGCGCGCGCCAG7CGAAGC7GACGCGGGCGG7GGGCAAAAGG ^ 

TTTCCcTGC i GCTG CTTCTTCATGAGCCTTCCGCGCCGCCGCCCGTGACrTCTGCGGCT 

■ hTCC1 ■ 
KGTTT j CK YS£GAAAG7£0AE PA?VEA0AGGGQK 

7GC7GG7ACGAAACG7CGTC7AAGAA77C 

: i 1 i ( — — 1629 

ACGACCA 7GCTT7GCAGCAGA7TC77AAG 

hTCCl U EcoRl | 

V L V R N V V £r 



hursday, July 22, 1 SSS 1 :25 ?M _ H f CC~ L \> f ] ■ i } Pace 

TCCl(-TO.l).mcd (1 > 122S) Site arc! Sa cugr.. V 
[nsymes : 2 of 5 1 5 enzymes (Fiitared) 

•.etti'ncs: Linear, Certain Sites C^B^ar.dard Gar.e-ic Coda : 



CA7A7GCA7CACCA7CACCA7CACA7GAGCAGAGCGr7CA7CA7CGA r ccaacga fcag tgcca 7 7gacqgcT 7g "A CGACC 7 Tt-fGGGGA 77GSAA 7AC 

G7A f aCG TAG 7GG TAG ~GG 7 AG 7G 7 AC 7CG ~C 7CGC A AG 7 vG 7 AGO 7 AGG 77GC 7 AG 

> Met/ HIS TAG » ■ 1 ■ h~CCl 1 



■KrtHHHHHHflSSAFl i 0 ? 7 i S A (OGLYOLLG i G t 
CC A ACCAAGGGGG7 A TCG 7 7 TAG 7CC7CAC7AGAG "AC 77CGA A AA AGCCC 7GGAGGAGC 7GGCAGCAGCG 777CCGGG7GA 7GGC 7GG 77AGG 7 7CGGC 



200 



GG77GG77CCCCCA7aGGAAA7GAGGAG7GA7C7CA7GAAGC77 i 77CGGGACC 7CC TC'ZACCG 7CG 7CGCAAAGGCCCAC 7ACCGACC A A 7CC A AGCCG 

— hTCC 1 ■ 

o N q G G ( L Y S 5 L £ V K A L c E L A A A ? G 0 G W U G S A 

CGCGGACA A A 7ACGCCGGCAAA AACCGCA ACCACG 7GAA 77 777T CCAGGAAC 7GGC AGACC 7CGA 7CG 7CAGC 7CA 7CAGCC 7GA 7CCACGACCAGGCC 

i ■ : 1 H- i i • i • '• = : ' i : : , r 300 

GCGCCTG77TA7GCGGCCG 77 7 7 7GGCG7 7GG7GCAC 77AAAAAAGG 7CC 7 i GAClG i l i GGAGCTAGCa.G 7CGAG7AG7CGGAC 7AGG 7GC7GG 7CCG"G 

AOKYAGKNRNHVN.-FQSLAOLORQLISLtHOQA 

AACGCGG7CCAGACGACCCGCGACA7CC7GGAGGGCGCCAAGAAAGG7C7CGAG77CGrGCGCCCGG"GGCTG7GGACC7GACC7ACA7CCCGG7CG7CG 

; i i 1 ■ ! ' i 1 i ! S i i s i r '400 

77GCGCCAGGTC 7GC7GGGCGC 7G 7AGGACC7CCCGCGG7 7C777CCAGAGC 7CAAGCA CGCGGGCC A CCG AC ACC 7GGAC 7GGA 7G7AGGGCCAGCAGC 

*y N A v Q 7 7 R 0 I L I G A :< :< G L E F V 3 P V A V 0 L 7 Y t P V V 
rV= GGCACGCCC7A7CGGCCGCC77CCAGGCGCCG7777GCGCGGGCGCGATGGCCG7AG7GGGCGGCGCGC77AAGC77GCC7iC77GG7CG7GAAAACGC7 

: : i i i i ! i ' 1 ! : ' : > : ' i i —"500 

ZZ CCG7GCGGGA 7AGCCGGCGGAAGG 7CCGCGGCAAAACGCGCCCGCGCTACCGGCA7CACCCGCCGCGCGAA7TCGAACGGA7GAACCAGCAC77TrGCGA 
O 1 hTCCI ■ I fHindo j | DELETED 

H A L3AAF0APFC A GAM A VVGGALKL A Y ' L" V V 1<" T" L 

5 GA7 r AACGCGAAGCT 7ACTC AAC 7CC7CAAA77GC 77GCCAAA77GGCGGAG 77GG 7CGCGGCCGCCA 77GCGGACA 7CA777CGGA7G7GGCGGACA7C 

La — — j 1 1 1 i ! 1 i ' i s 5 i ■■ i » i i r 500 

^ C 7AG 7TGCGC77CGA A 7GAG77GAGG AG 777AACGAACGG777AACCGCC7CA ACC AGCGCCGGCGG J. A ACGCC7G 7AG7AAAGCC7ACACCGCC7G 7AG 

2 ■ QSLETEQ j \ Hind3 t 'hTCC H 

P ( Na kl7qlli<lla:<la£lvaaa t a o r isovao t 

y A rCAAGGGCA7CC7CGGAGAAG7G7GGGAG77CA7CACAAACGCGC7CAACGGCC7GAAAGAGC7T"GGGACAAGC7CACGGGG7GGG7GACCGGAC7G7 

D ; ; , ! 1 i . i ' ! ' ^ ' i 1 i i i i r 700 

7AG77CCCG7AGGAGCC7C7 7CACACCCTCAAG7AG7G777GCGCGAG77GCCGGAC7 i 7C 7CGAAACCC7G77CGAG 7GCCCCACCCAC7GGCC 7GACA 

- hTCCI ■ 
[KG [LGEVWtFCTNALNGLKELWOKLTGWVTGL 

7CT^7CGAGGG7GGTCGAACC7GGAG7CC77C777GCGGGCG7CCCCGGC77GACCGGCGCGACCAGCGGC77G7CGCAAG7GAC7GGC77G77CGG7GC 

i t i . 1 f 1 ! i 1 i s i 1 i — i i- 300 

AGAGAGCTCCCACCAGC7TGGACC7CAGGAAGAAACGCCCGCAGGGGCCGAAC7GGCCGCGC7GGTCGCCGAACAGCG77CAC7GACCGAACAAGCCACG 

- hTCCI ■ 

FSRGW SNL&SFF^AGVPGLTGA 73GLSQV TGLFG A 

GGCCGG7C7GTCCGCATCG7CGGGCT7GGC 7CACGCGGA7AGCC7GGCGAGC 7CAGCCAGCTTGCCCGCCC7GGCCGGCA77GGGGG CGGG7CCGG77T7 

( | . ■ j | ■ -- j ■■ *■ - i * ■ ■' ■■ - "I • ■ [ ■•■-<■ ! t ', 1 f" 900 

CCGGCCAGACAGGCGTAGCAGCCCQAACCGA&TGCGCCTATCGGACCGCrCGAGTCGGTCGAACGGGCGGGACCGGCCGTAACCCCCGCCCAGGCCAAAA 

hTCCI — — 

AGLSASSGLAHAOSLASSASLPALAGIGGGSGr 




lursday. July 22, 199S 1:25 ?M ? " , 
rCCU-70.1).mcd (?> 12251 Sita and Sscug fl : 

1GGGGC T 7GCCGAGGC 7GGC 7CAGG 7CCA 7 G C^^^^r G i A G 7GGGC AGGCGC 7ACGGGCC CGAGG 7G A 7GGCG'J^^»3GGGC CGC 7 GC C GAGC AGG 7:~ G 

' = ; 5 ! ; ! ! 1 ! 5 " '■■ ' i • ^ I GOO 

GCCCCGAACGGC TCGGACCGAG 7CCAGG "AGGGCGGAG 7 7G.-GCCG 7CGGCGA 7GCCGGGGC TGGAG 7 AC CGGGCC AGCCGCGGCGACGGC 7GG 7 CC AG G 
^ , ■hTCCl — ^ 

GG L PSLAQVH. 4 A5T. ' GAL 3. 3 7 GAAAiQV 

GCGGGC AG "CGCAGC ~GG TC 7CCGCGCAGGG 7 "CCCaaGG 7 A 7 GGGCGGACCCG 7 AGGCA7GGGCG GCA 7GCACCCC 7C 77CGGGGGCG 7CGAAAGGGAC 
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CGCCCG 7CAGCG 7CGACCAGAGGCGCG 7CCCAAGGG 77CCA 7ACCCGCC 7GGGCA i CCG 7ACCCGCCG 7ACG 7GGGGAGAAGCCCCCGCAGC 777CCC7G 
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GGGSQLVSAQGSGGflGG^VGrtGG^HasSGASKG 7 

GACGACGAAGAAG TAG 7CGG A AGGCGCGGCGGCGGGCAC7G A AG A CGCCG AG CGGGCGCC AG TCGA AG C 7GACGCGGGCGG7GGGCAAAAGG7GC 7GG 7 A 
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C "GC "GC TTCTTCA 7GAGCC TTCCGCGCCGCCGCCCG TGACTTC 7GCGGC FCGCGCGCGG TCAGC T7CGAC rGCGCCCGCCACCCGTTTTCCACGACCA T 
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iTCC1(f!)-T,VI2 Map.MPO (1 >J225) Site 3~* Seojsr.ca 
Enzymes : 1 cr-515 anr/mes (Hlterad) 



Seines : Circular. Carra in Sit^^iv. Standard Genetic Code ^ 

cat a tgc a re acca rc acc a tc acaT'j^u^^sa-j'-^ i rcArcArcGArccAACQArcAGrGCCAi^SR-^LTrajACGACc rrcrGGGGA 

j = i : i i i ! 5 : i ! i ^ : * f— Q| 

G7ATACGTAG7GG7AG7GG7AG7G7AC7CG7C7CGCAAG7AGTAGC7AGG77GC7AG7CACGG7AACrGCCGAACA TGC TGGAAGACCCC 7 
| " hTCCl 1 

H fl H K H H ri H fl 5 « A r I I 0 P T E 3 A 1 Q q L Y 0 L L G 
T TGGAATACCCAACCAAGGGGG TATCC FT TAG TCC TCACTAGAG TAC TTCGA AAA AGCCC TGGAGGAGC7GGCAGCAGCG T T TCCGGG TGA 

: : = 1 ■ ! l i : : = \ ; : ; ; " , 82 

AACC77A7GGG7TGGT 7 CCCCCA TAGGAAATGAGGAGTGA FC TCATGAAGC TTT i 7CGGGACC TCC T CGACCG TCG TCGCAAAGGCCC AC T 



G ! F N Q G G I L Y 5 5 L £ Y F £ X A L E E L A A A F ? G 0 



7GGC 7GG7 7AGG 77CGGCCGCGGAC A AA 7ACGCCGGC AAA A ACCGC A ACCACG TGA A 777 i i 7CCAGGAAC7GGCAGACC7CGA7CGTCAG 

= 1 i i ; ■ i 1 - : » h ! i , 1 : ; 273 

ACCGACCAATCCAAGCCGGCGCC7GTTTA7GCGGCCG77777GGCG7 7GG 7GCAC7 7AAAAAAGG7CCr7GACCG7C7GGAGC7AGCAG7C 

'-"^^ 

GWLGSAAOKYAGKNRNHVNFFQELAOLORG 
C7CATCAGCC7GA7CCACGACCAGGCCAACGCGG7CCAGACGACCCGCGACAAGC77A 7CC 7GGAGGGCGCCAAGAAAGG7C TCGAG77CG 

— s i 1 i ■ i ' s 1 ' : i < i ! i 35*4 

P^GAG7AG7CGGAC7AGG7GC7GG7CCGG7 7GCGCCAGG7C7GC7GGGCGC7G 77CGAA7 AGGACC7CCCGCGGr7C7 7 7CCAGAGC7CAAGC 
?g hTCCl l |Hir,d3 1 Q£LET£D 



\P- L I SL i H 0 Q A N A V G 7 T ,9 0 !< L [ LEGAKKGLEF 

00 

ffl 7GCGCCCGG7GGC7G7GGACC TGACC TAC A TCC CGG7CGTCGGGC ACGCCC 7 A 7CGGCCGCC 7 7CC A GGCGCCG 77 77GCGCGGGCGCG A 7 

m-i i s i i 1 i i ' i 1 5 5 i = ! ^— \ = ^55 

" « ACGCGGGCC ACCGAC ACC 7GGAC 7GGA7G7AGGGCCAGCAGCCCG 7GCGGGA 7 AGCCGGCGGAAGG 7CCGCGGCAAAACGCGCCCGCGC 7A 

s VRP-V.AVOLTYtPVVGHALSAA."QAPFCAGAM 

M . 

r- GGCC~7AG 7GGGCGGCGCGC TTGCC TAC TTGG TCGTGAAAACGC i.GA i CAACGCGACTCAACTCC7CAAA77GC77GCCAAA7TGGCGGAG 

^ J , ; , i . i « i ' i = i 1 i . i i- 5'46 

\Z CCGGC A TC ACC CGCCGCGCGA ACGGA7GAACCAGCAC77TTGCGAC T AG TTGCGC i GAG77GAGGAG777AACGAACGG7T7AACCGCC7C 
y -—0£LET£0 

D 

q AVVGGALAYLVVKTLtNA 7GLLKLLAKLAE 

TTG G TC GCGGCCGCC A T TGCGG AC A TCA T T TCGGATGTGGCGGAC A TCATC A AGGGC A TCC TCGGAGAAGTGTGGG AG TTC A TC AC A A ACG 

i f ; 1 1 1 ! f 1 1 i i 1 1 i 1 1 — i 637 

AACCAGCGCCGGCGG7AACGCC7G7AG7AAAGCC7ACACCGCC7G7AG7AG77CCCG7AGGAGCC7C77CACACCC7CAAG7AG7G777GC 

■ D£L£T£D ■ 

LVAAAtAOtlSOVAOI E KG t L G E V W E F t 7N 

CGAAGC77C 7C A A r GGCC 7GA A AGAGC 7 77GGG AC AAGC 7CACGGGG 7GGG7GACCGG AC 7G77C7C7CGAGGG7GG7CGAACC7GGAG7C 

I • : " , | 1 «-i 1 ! > ! 1 ! 1 i 1 i 1 723 

GC77CGAAGAG77GCCGGAC777C7CGAAACCC7G77CGAG7GCCCCACCCACTGGCCTGACAAGAGAGC7CCCACCAGC77GGACC7CAG 

H P^da l hTCC1 

AKLLNGLKEL.WOKLTGWVTGLFSRGWSNLES 

C77C777GCGGGCG7CCCCGGC77GACCGGCGCGACCAGCGGC77G7CGCAAG7GAC7GGC77G77CGG7GCGGCCGG7C7G7CCGCA7CG 

I , h 1 1 ' 1 ' 1 ■ 1 ' 1 * 1 1 ! ' 3iS 

GAAGAAACGCCCGCAGGGGCCGAAC7GGCCGCGC7GG7CGCCGAACAGCG77CAC7GACCGAACAAGCCACGCCGGCCAGACAGGCG7AGC 

f f A Q VPGL7GA73GLSGV 7GLFGAAGUSAS 

ATG- 



Friday, July 23, 19S9 3:*1 AA/! 

hTCCl'ffn-T.V12 Mac.MFO ( ■ > 1225) Site 1 Secuenca 



7CGGGC 77GGC 7CACGCGGA 7AGCC TCG Z 7CAGCC AGC 7 7GCCCGCCC 7GGCCGGCA 7 TGG^^^bGG 7CCGG 7 77 7G P -G^GC 7 f-^ 1 " 

AGCCCGAACCGAG "GCGCC 7A 7CGGACCGC 7CGAG 7CGG fCGAACGGGCGGGACCGGCCG ^AACCCCCGCCCAGGCC AAA AC r C'^C 1 ^ * A*~g ^ 
1 ■■ 1 hTCCl 



0 3 L A 3 S A S L P A L A G i GGG5 



CGAGCC 7GGC 7CAGG 7CCA 7GCCGCC fCAAC 7CGGCAGGCGC 7ACGGCCCCGAGC 7GA 7GGCCCGG ^CGGCGCCGC 7GCCGAGCAGG7C r -G 
GC7CGGACCGAG 7CCAGG 7 ACGGCGGAG 77GAGCCG 7CCGCGA7GCCGGGGC 7CGAC '" A CCGGGCCAGCCGCGG~GACGGC 7CG7fCAGCC 1001 



LAGVHAAS7RGA L S P 3 A Q G P y Q A 



£ G V G 



CGGGCAG7CGCAGC7GG7C7CCGCGCAGGG77CCCAAGG7A7GGGCGGACCCG7AG GCA7GGGCGGCA7GCACCCC7C7 TCGGGGfiCK rrn 
GCCCGrCAGCGTCGACCAGAGGCGCGrcCCAAGGGT-CCATACCCGCCTGGGCArcCGTACCCGCCGrACGrGGGGAGAAGCCCCCGCA^C 



ggsglvsaggsggmggpvgmggmh? 



G A 



AAAGGGACGACGACGAAGAAGTACrCGGAAGGCGCGGCGGCGGGCACTGAAGACGCCGAGCGCGCGCCAGrCGAAGCTGAC^CGGGCGGTG 

^ 777CCC 7GC7GC 7GC 7 7C 7 7C A 7GAGCC 7 7CCGCGCCGCCGCCCG 7G AC 7 7C 7GCGGC "CGCGCGCGG 7CAGC 77CGAC 7GCGCCCGCCAC ' 
y hTCCl ~~ 

m K Q ' ' T:< :< YS£gaaa g "^oaerapveaoagg 

II GGCAAAAGGTGCTGGTACGAAACGTCGTCTAACGGCGAArrc 

^ -h i ! 1 . 1 , ■ * 1223 

CCG7777CCACGACCA7GC777GCAGCAGA77GCCGC77AAG . 
^ 1 ' hTCCf- 1 [EcoRI 
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Monday, July 25. iS9S 3:25 PM „ , .< / r- ~lj Z 'J~ - _J> / j ~ j 3 r , - - — 

ht(134-392)-HS-hi(iM29).mcd (1 > 2232) Si: c Sequence ( r "- e 

Enzymes : 3 c ■ 5 1 5 enzymes (Filtered) 



Settings: Linear, Certain Sites^^Bf Standard Genetic Coca 



C A 7A 7GC A ~C AC CA ; CACCA 1 CACGA 7G 7GGCGGACA 7CA 7CAAGGGC a 7CC 7CGGAG A AG 7G fGGGAG 7 7CA 7CACAAA CGCuC TC a ACGGC* 7GAAAG ' 

G7a7ACG7AG7GG7AG7GG7AG7GC7ACACCGCC7G;AG7AG77CCCG7 AGGAGCC7C77CACACCC 7CAAG7AG7G777GCGCGAG 77GCCGGAC 77TC ^ 
^ ... t ., , viet /HIS TAG H i ■ ■ 1 ) ■ ■ hTCCl (',3^392; ~ 

KnHHKHHKOVAOi (KG 1LG£VW£F | "NALNGLX 

AGC 777GGGACAAGC 7C ACGGGG7GGG 7GACCGGAC 7G 77C 7C 7CGAGGG 7GG 7CGAACC 7GGAG "CC 77C 77 7GCGGGCG 7C r CCGGC 77GACCGG~ r: C 

! '• ~ : 1 ! : ! : ! • : 1 i ■ ! — ■ i — . 1 9QQ 

7CGAAAC C C 7G 7 i CG AG 7GCCCCACCC AC 7GGCC 7G AC A AG AG AGC 7CCC A CC AGC 7 "GGACC 7CAGGA AGAAACGCCCGC AGGGGCCGAAC 7GGCCGCG ~" 



' hTCCl (134-3S2)' 



SLWQKLTGWVTGLPSRGWSNL-I3FFAGVPGLTGA 

GACCAGCGGCTTG7CGCAAG7GAC7GGC77G77CGG7GCGGCCGG7CTG7CCGCA7CG7CGG GC77GGC7CACGCGGA7AGCC7GGCGAGC7CAGCCAGC 
C7GG7CGCCGAACAGCG77CAC7GACCGAACAAGCCACGCCGGCCAGACAGGCG7AGCAGCCCGAACCGAG7GCGCC7A7CGGACCGC7CGAG7CGGTCG ^ 



'hTCCl (18^392) 1 



TSGLSQV7GLFGAAGL3A3SGLAHA05LASSA3 

7 7GC CCGCCC 7GGCCGGC A 77GGGGGCGGG7CCGG 77 7 7GGGGGC 7 7GCCGAGCC "GGC 7CAGG 7CCA7GCCGCC7CAAC7CGGCAGGCGC "ACGGCCC-" 

1 ' ! 1 ! ! ! ! ! ! = ! ■ ■ i f : i i r ilQO 

" AACGGGCGGGACCGGCCGTAACCCCCGCCCAGGCCAAAACCCCCGAACGGC 7CGGACCGAG 7CCAGG7ACGGCGGAG77GAGCCG7CCGCGA7GCCGGGG 
_ ' hTCCl (184-392) — — — ^ ■ 

^ L P A L A G [GGG5G'GG L?SLA0VHAA5TR0ALRP 
^ GAGC7GA7GGCCCGG7CGGCGCCGC7GCCGAGCAGG7CGGCG GGCAG7CGCAGC7GGTC7CCGCGCAGGG77CCCAAGG7A7GGGCGGACCCG7AGGCA7 

jjy 1 ■ ! ; ! ■ ■ I j j j j j • . j _ j , 1_, Zqq 

gj CTCGAC7ACCGGGCCAGCCGCGGCGACGGC7CG7CCAGCCGCCCG7CAGCG7CGACCAGAGGCGCG7CCCAAGGG77CCA7ACCCGCC7GGGCATCCG7A 
Q) 1 1 — — — him fifli.^ — M — — _ 

SJ R A 0 G P V G A A A £ G V G G G 3 Q L V S A Q G S Q G fl ~ G G F V " G M 

HI 

^~ GGGCGGCA7GCACCCCTC77CGGGGGCG7CGAAAGGGACGACGACGAAGAAG7AC7CGGAAGGCGCGGCGGCGGGCACTGAAGACGCCGAGCGCGCGCCA 

* i ' i ' i ! ! i i 1 1 ■ : . 1 f : 1 u QOO 

P== CCCGCCG7ACG7GGGGAGAAGCCCCCGCAGC777CCC7GC7GC7GC77C7TCA7GAGCC77CCGCGCCGCCGCCCG7GAC77CTGCGGC7CGCGCGCGGT 



• hTCCl (184-392) ' 



GGMHPSSGAS!<GT77KKYS£GAAAG7E0A£RAP 

G7CGAAGC7GACGCGGGCGG7GGGC AAAAGG7GC7GG7ACGAAACG7CG7CGAATTCA7GG7GGA777CGGGGCG77ACC ACCGGAGA 7CAAC7CCGCGA 

' ' 5 ! ! ! ' ! ! ' 1 ! = i * i— < i 1 r 700 

CAGC7TCGAC7GCGCCCGCCACCCG7777CCACGACCA7GC777GCAGCAGC77AAGTACCACC7AAAGCCCCGCAATGG7GGCC7C7AG7TGAGGCGCT 



1 hTCCl (184-392) j | EcoRl % TbN9« 



VEADAGGGQ KVLVRNVVEP^VCFGALPPE I N S A 

GGA7GTACGCCGGCCCGGG7TCGGCC7CGC7GGTGGCCGCGGC7CAGA7G7GGGACAGCG7GGCGAG7GACC7G7777CGGCCGCG7CGGCGT7TCAG7C 

1 i 1 '' 1 ? •■ i 1 i ' 5 • i 1 ! 1 i 1 i- 800 

CC7ACA7GCGGCCGGGCCCAAGCCGGAGCGACCACCGGCGCCGAG7C7ACACCC7G7CGCACCGC7CAC7GGACAAAAGCCGGCGCAGCCGCAAAG7CAG 
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RM Y AGPGSASLV_AAAQMW05 V A 50LFSAASAFQ S 

GG7GGTCTGGGGTCTGACGG7GGGG7CG7GGA7AGG77CG7CGGCGGGTC7GA7GG7GGCGGCGGCC7CGCCG7A7G7GGCG 7GGA 7GAGCGTCACCGCG 

1 1 • ' ! > ! — ► f ! 1 ■ 1 ■+ i 1 1 i f- 900 

CCACCAGACCCCAGACTGCCACCCCAGCACCTATCCAAGCAGCCGCCCAGACTACCACCGCCGCCGCAGCGGCATACACCGCACCTACTCGCAGTGGCGC 
VVWGLTVGSWIGSSAGLMVAAASPYVAWllSVrA 
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ncf 13^-j92:-HS-htflM29).,-ncci (1 > ?ri?> .^ j^ Secuer.ca 2 

GGGC A GGCCG AGC 7GACCGCCGC CCAGG TCC^^^GC 7GCGGC GGCC 7 ACG AG AC GGCG 7A "GGGC ~^-CG-jj^fcCCGC r GG TG A 7CGCCG ' r - c ~ 
CCCGTCCGGC 7CGAC7GGCGGCGGG 7CCAGGCCCAACGACGCCGCCGGA 7GC TC TGCCOC£TAC r ~GA r T£ i ~ r ±rr~^^T~rr<~*r~ i ! ' 'OCO 

- - — -^^u^cugc.c i aucggg rc r tgg 

1 ' 1 ■ 1 11 I O'r-9 imiiiii ■■ 

G G A £ L i A A G V R V A A A A y £ T A Y G L 7 7 ? ? p v ( A c N 

G 7GC "G A AC I"G A TQA T TC 7GA TAGCGACCAACC TCT7GGGGCAAAACACCCCGGCGA 7CGCGC "CAACGAGGCCGAA TACGGC GAGA 7G TGGGCCCAAGA 
CACGaCTTGaCTACTAAGACTaTCGCTGGTTCGAGAACCCCGm^ 1 !C0 

* A E L fl t L I A T N L L G 0 «M T P A ( A V ,m r 4 - Y _ » 

~ - ^ G c W A Q 0 

CGCCGCCGCGArGTTTGGCTACGCCGCGGCGACGGCGACGGCGACGGCGACGTTGCTGCC^TTCG ^^r^^r-r.. .„ 

, : , 1 , : : : ■ '^^^•-^CuCAaAiGACC AGCGCGGG 7GGGC 7C 

GCGGCGGCGC TACAAACCGA TGCGGCGCCGC TGC CGC 7GCCG C TGC CG C 7GCAACGA CGGC A AGC TCCTCrrrr^rcr--^ .^-^ 1 ' ~ 1200 

<-i-bt-uuCC i C i AC i ijG i CGCGCCCACCCGAG 
i TbH9 — — — 

AAAi1rGYAAA7 A7A7A7L! ?="c r. 13 - u _ 

- - A • c i 5 A G G L 

CrCGAGCAGGCCGCCGCGGiCGAGGAGGCC.CCGACACCGCCGCGGCGAACCAGrTGATGAA CAArGTGCCCCAGGCGC7GCAACAGCrGGCCCAGCrrd 

GAGCrCGTCCGGCGGCGCCAGCTCC 7CCGGAGGC7G7GGCCGCGCCGC 7 TGG TC A AC 7ACT7GT 7ACACGGGG TCCGCGACG TTG 7CGA C C GGG TC GGG T ' 3 °° 

" uuo 

^t-SQAAAveeASOTAAANQLflNNVPQaLQQL^Qp 

^ CGCAGGGCACCACGCCTTCTrcCAAGCrGgG-GGCCrGrGGAACACGSTCrCGCCGCArCGGrcGCCSArCAGCAACArGGrGrcgATGGCCAACAACCA 

5 GCgTCCCG-GGTgCGGAAGAAGGTTCGACCCACCGGACACCrTCrGCCAGAGCGGCGTAGCCAGCGGCTAGTCgrrGrACCACAGCrACCGGrTGrrGGr 
= "n-uo — ■ — 
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CArGrCGATGACCAACTCGGgTGrGTCGArGACCAACACC-rGAGCrCGArGrrGAAGGgC-TrgC-CCGGCGGCGGCCGCCCAGGCCGTGCA^CCGCG 

GTACAGCTACrGGrTGAGCCCACACAGCTACrGGrTGrGGAACTCGAGCrACAACTrCCCGAAACGAGGCCGCCGCCGGCGGGTCCGGCACGrTTGGCGC 
" 'ToH9 

"SH-NSGVSMTNT LSSflLXGFAPAAAAOAVaTA 

GCGCAAAACGGGG-CCGGGCGArGAGCTCGC-GGGCAGC-CGC-GGGT-CT-CGGG -CrGGGCGGrGGGGTGGCCGCCAACrTGGGTCGGGCGGCCiCGG 

CGCG TTT 7GCCCCAGGCCCGC TA C TCGAGCGA CCCG TCGAGCG ACCCAAGA AGCCCAG A CCCGCCACCCCACCGGCGG TTGA AC CC AG CCCGCCGG AGC C 

TbH9 i 
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A Q N G V R A fl 5 S L G S S L G S S G L G G G V A A N L G R A A S 
TCGGTTCGTTGTCGGTGCCGCAGGCCf GGGCCGCGG 

agccaagcaacagccacggcgtccggacccggcgccggttggtccgtcagtggggccgccgcgcccgcgacggcgactggtcggactggtcgcggcgcc t 1700 



VG5LSVPQAWAAANQAVTPAARALPLT5LTSAAE 
A A GAGGGC CC GGGC AG A TGCTGGGCGGGCTGCCGCTGGGGCAG A TGGGCGCCA GGG CC^ 

rTCrcCCGGGCCCGTCTACGACCCGCCCGACGGCCACCCCGrCTACCCGCGGTCCCGGCCACCACCCGAGTCACCACACGACGCACAAGGCGGCGCrGGG ' 8 °° 

R -GPGaMLGGLPVGQflGARAGGGLSGVLRVPPRP 

TATGTGA TGCCGCA TTCTCCGGCAGCCGGCGATATCATGAGCAGAGCGT TCATCA TCGA7CCAACGATCAGTGCCATTGACGGCTTGTACGACCTTCTGG 

' 1 ' f 1 * ' 1 1 1 j ' i ■ 1 » i 1 h IQOO 

A TA C AC TAC GGCG T A AG AGGCCGTCGGCCG CTA TAG T AC7CG 7CTCGCAAGTAG7AGC 7AGG T7GC7AG7CACGG TAACTGCCGAACA TGC TGG A AG A CC 

r ° H 9 1 | RV | ihTCCl ^ — ] 
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FTC,. to 
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,t»34.3S2>-HS-htf1-l2S). m ed f I > 2232) Si» J Uaivxa C ?a S 3 3 

CGA 7 7GC AA 7ACCC A ACCA AGGGGG 7 A 7CC 7 CC7CAC 7AGAC r AC 77CGAAAAAGCC C 7GCAGGACC 1^^CAGCG7 77CCGGG rCA7 
1 '■ '' '• ! ' = : ! : ! '~ : rr^^K : -. : ' 



CC7AACC 77A7GGG7 7GG 77CCCCCA 7AGGAAA 7GAGGAG7GA "C 7CA7GAAGC " ^"CGGGACC 7CC 7CGACCG TCG TCGC A A AGGCCCAC 7ACCC '' 
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G 77AGG T 7CGGCCGCGGAC AAA 7 ACGCCGGCA AAA ACCGC A ACCACG 7GAA 7 7 77 77CCAGGA AC 7GGCAGACC 7CGA TCG 7CAGC7CA 7CAGCC "'"A 7 r 

CAA 7CCAAGCCGGCGCC 7G 777A 7GCGGCCG 777 77GGCG77GG 7GCAC77AAAAAAGG7CC 7 7GACCG 7C 7GGAGC7AGCAG7CGAG 7AG7CGGAC 7SG 2I °° 

G 5 A A 0 K Y A G X N S N H V N .- .- Q .£ L x Q 0 , Q ( , 3 , , 

CACGACCAGGCCAACGCGG7CCAGACGACCCGCGACA ~CC 7GGAGGGCGCC A AG A A AGG ~C 7CGAG 7 7CG 7GCG CCCGG 7GGC 7G7GGACC7GACC 7A ~ A 

GrGC7GG7CCGG7rGCGCCAGG7C7GC7GGGCGC7G7AGGACC7CCCGCGG77C777CCAGAGC7CAAGCACGCGGGCCACCGACACCrGGAC7GG'7G7 2200 
■ ' hTCCl [' 



□ 

m 
m 
m 

M 

ru 



3 
3 
3 



H0QANAVQTT3 0 ! L S G A :< :< G L £ F 

TCCCGGrCGTCGGGCACGCCC7A7AAGATATC 

i ■ i 1 1 r— 2232 

AGGGCCAGCAGCCCGTGCGGGA ~A i*TC ~A TAG 

hTCCl (1-129) I | RV I 

t P V V G H A L . 0 I 
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G7A7 A CG7AG7GG7 AG 7GG 7AG7G7AC 7CG 7C 7CGCAAG 7 AG 7AGC 7 A GG 7 7GC 7AG 7C AC GG TA^C 7GCCGAACA 7GC 7GGAAGA( ~ ~ ~ ' 



Monday, July 2S, 1 39S 2:42 ' ' ' 1 ~" " ' ' . - - v ~ • 

h:(l-14S)-H9-hc(16l-3S2).mpd (1 > 2365) - ^ and Sequence ^ p age 

£nzym»is : 3 0*315 anzy.T.as (Filtered) ^^.v * 

Serines : Circular. Certain S j^^fcniv, Standard Genetic Cede 

CATATGCAiCACCATCACCA l .,..LAiL^^KAb-.' J! I LA , CA 7CGA 7 CC AACG A 7 C AG 7 G C CA^^^GCTr g 7ACG ACC . TC fGGGG A 

c i 

1AGACCCCT 

hTCCl f I ' '?) 

f C 0 P 7 f 5 A i o g L Y 0 L L G 
7 7GGAA 7ACCC AACCAAGGGGG 7A 7CC 7 7 7AC 7CC 7CAC 7AGAG 7AC 7 7CGA AAA AGCCC 7GGAGGAGC 7GGCAGCAGCG 777CCGGG7GA 

AACC 7 7A 7GGG 7 7GG 7 7CCC CCA 7AGGAAA 7GAGGAGTGA 7C7CA7GAAGC77777CGGGACC7CC 7CGACCG7CG 7CGC AAAGGCCCAC 7 

hTCCl M <" nx " 
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7GGC7GG77AGG 7 7CGGCCGCGGACAAA 7 ACGCCGGC A A AAACCGC A ACCACG7GAA 7 777 7 7CCAGGAAC7nGCAGACC 7CGA7C r 7 
ACCGACCAA7CCAAGCCGGCGCC7G777A7GCGGCCG7777 7GGCG7 7GG7GCAC77AAAAAAGG7CC77GACCG7C7GGAGC7AGCAGTC ^ 



' hTCCl (M49)i 



GWLGSAAOKYAGKNRNHVNFF Q£LAOLQRa 

C 7CA 7CAGCC 7GA 7CCACGACCAGGCCAACGCGG7CC A GACGACCCGCGACA 7CC7G GAG GGCGCCAAGAAAGG7C7C GAG 77CG7GCGCC 

Q GAG7AG7CGGAC7AGG7GC7GG7CCGG7 7GCGCCAGG7C7GC7GGGCGC7G7AGGACC7CCCGCGG77C777CCAGAGC7CAAGCACGCGG " 
yl hTCCl f« * 

jj L E S L 1 H 0 Q A .N A V Q 7 7 R 0 t L £ G A :< ;< G L E F V R 

GO CGG7GGC7G7GGACC7GACC7ACA7CCCGG7CG7CGGGCACGCCC7A7CGGCCGCC7 7CCAGGCGCCG7777GCGCGGGCGCGA7GGCc"g7 

^ " f * ! 5 1 1 * 1 ! ' 1 ' ^ ■ 1 1-,,.?" !>. ass 

SJ GCCACCGACACC7GGAC7GGA7G7AGGGCCAGCAGCCCG7GCGGGA7AGCCGGCGGAAGG7CCGCGGCAAAACGCGCCCGCGC7ACCGGCA ' 
rd hTCCl '< 

!\ P v A V 0 L T Y [ p V V G H A L 3 A A F Q A P F C A G A (1 A V 

□ AG7GGGCGGCGCGC77AAGC77A7GG7GGA7 77CGGGGCG77ACCACCGGAGa'7CAAC7CCGCGAGGA7G7ACGCCGGCCCGGG77CGGCC 

U 7CACCCGCCGCGCGAA77CGAA7ACCACC7AAAGCCCCGCAA7GG7GGCC7C7AG77GAGGCGC7CC7ACA7GCGGCCGGGCCCAAGCCGG ^ 
3 —hTCCl " ^N^j^^i r 

3 VGGALXLP1V0FGALPP£[N3ARMYAGPGS-A 

7CGC7GG7GGCCGCGGC7CAGA7GTGGGACAGCG7GGCGAG7GACC7G777 7CGGCCGC37CGGCG777CAG7CGG7GG7C7GGGGTC7GA 
I i i « I " f i f i i . 1 , : , 1 , 

AGCGACCACCGGCGCCGAGTC7ACACCC7G7CGCACCGC7CAC7GGACAAAAGCCGGCGCAGCCGCAAAG7CAGCCACCAGACCCCAGAC7 

— LJO " 

SLVAAAQMW05VAS0LFSAASAFQSVVWGL 

CGGTGGGG7CG7GGA7AGG77CG7CGGCGGG7C7GA7GG7GGCGGCGG7C7CGCCG7A7G7GGCG7GGA7GAGCG7CACCGCGGGGCAGGC 

1 ' 1 1 1 rH ' i • -f- 1 i , 1 f 1 , 723 

GCCACCCCAGCACC7ATCCAAGCAGCCGCCCAGAC7ACCACCGCCGCCAGAGCGGCA7ACACCGCACCTAC7CGCAG7GGCGCCCCG7CCG 

T V G S W [ GSSAGLHVAAVSPY VAWMSVTAGQA 

CG AGC 7GA CCGC C GC CC AGG 7CCGGG 7 7GC 7GCGGCGGCC T ACG AG ACGG CG TA 7GGGCTGACGG7GCCCCCGCCGG7G A TCGCCG AG AAC 

' ' * ' * 1 1 » 1 1 * 1 ! i i 1 1 1 819 

GCTCGACTGGCGGCGGGTCCAGGCCCAACGACGCCGCCGGATGCTC7GCCGCArACCCGACTGCCACGGGGGCGGCCAC7AGCGGCTCTTG 
— — — — — — — — — — TbH9 " 

EL7AAQVRVAAAAYE7AYGL7VP-PPV I A E N 

fid- II 



Monday, July 25, 1999 2.42 r.Vl 

h;(1-1 49)-H9-htM51 -392). mod M > 2365) - c ;^ ard Secusnca ^ • Fags 



CG TGC TGAAC TGATQATTC TGA TAGCGAC^^^C TC t "GGGGCA AAAC ACCCCGGCGA "CGCGG T 



GGCCGAA tacggcgaga TG T 




R A £ L fl f L f A f N L L G Q N T « A i A 7 >j £ a £ y c c rt 
GGGCCCAAGACGCCGCCGCGA TG f TTGGC T ACGCCGCGGCG ACGGCG AC GGCG AC GG C GA CG T TGC fGCCG f fCGAGGAGGCGCtGHAP-A ~ 
CCCGGGT FC TGCGGCGGCGC TACAAACCGA fGCGGCGCCGC TGCCGC fSCCGC TGCCGC 7GCAACGACGGC A AGC TCC7CCGCGGCCTCTA 1001 




W A Q 0 A A A 

£ A P £ ft 

GACCAGCGCGGGrGGGCrCCrCGAGCAGGCCGCCGCGGrCGAGGAGGCCTCCGACACCGCCGCGGCGA ACCAGTTGArGAACAATnrnrrr 

C TGGTCGCGCCCACCCGAGGAGC TCG "CCGGCGGCGCCAGCTCCTCCGGAGGC "GTGGCGGCGCCGCTTGGTCAAC fAC TTGTTACACKG 1QS2 

^ 5 A G G L L £ Q A A A V £ E A S 0 T A A A N Q L M N N v ? 

CAGGCGCTGC AACAGC TGGCCCAGCCCACGCAGGGCACCACGCC TTC TTCC AAGC TGGG TGGCC TG~GGAAGACGGTC TCGCCGCAT^GGT 
^GrcCGCGACGrrGTCGACCGGGTCGGGrGCGTCCCGTGGTGCGGAAGAAGGrTCGACCCACCGGACACCr TCTGCCAGAGCGGCGT AGCCA ' 

jJ QA( -GGLACPTQGTT PSSXLGGLWKTVSPHR 
B CGCCGATCAGCAACATGGTGTCGATGGCCAACAACCACATGrCGATGACCAACTCGGGTGrG ~CGATGA CCAACACCTTGAGCTCGATG r T f 
jJlGCGGCTAGTCGTTGTACCACAGCTACCGGTTGTTGGTGTACAGC TAC TGGTTGAGCCCACACAGC TAC7GGTTG TGGAAC TCGAGC TAC A A 

^ S ? I S N il V 3 (1 A N N H i^l S fi 7 N S G V S tl T iN T L S S M L 
^ ^AAGGGCTTTGCJCCGGCGGCGGCCGCCCAGGCCG y^^^^ACCGCGGCGC A AAACGGGGTCCGGGCGA TGAGC TCGC ~GGGC AGC TCGC fG 
J CfTCCCGAAACGAGGC CGCCGCCGGCGGGTCCGGCACGr ■ TGGCGCCGCG i TTTGCCCCAGGCCCGC TAC TCGAGCGACCCG TCGAGCG AC 1365 

3 " 



KGFAPAAAAQAVQTAAQNGV 



^AMSSLGSSL 



GGTTC rTCGGGTCTGGGCGGTGGGGTGGCCGCCAACrrGGGTCGGGCGGCC TCGGTCGGTTCG TTGT CGG TGCCGCAGGCCTGGGC CGCGG 
CCAAGAAGCCCAGACCCGCCACCCCACCGGCGGTTGAACCCAGCCCGCCGGAGCCAGCCAAGCAACAGCCACGGCGTCCGGACCCGGCGCC 1 ^ 



«TbH9- 



GS SGLGGGVAANLGRAASVGSLSVPQAW-AA 

CC A ACCAGGC AG TCACCC CGGCGGCGCGGGCGCTGCCGCTG A CCAGCC TGACCA^ 

GGTTGGTCCGTCAGTGGGGCCGCCGCGCCCGxCGACGGCGACrGGrCGGACTGGTCGCGGCGCCrTTCrCCCGGGCCCGTCTACGACCCGCC 

ANQAVTPAARALPL.TSLrSAA£RGPGGMLGG 
GC TGC CGG TGGGGC AG A TGGG C GCC AGG GCCGG TGG TGGGCTC AG TGGTG TGC TGCG TG T 7CCGCCGCGACCCT A TG TGA TGCCGC A TTCT 

1 1 1 1 f i ! i ! i i ■ 1 f , | , j t - | Q3Q 

CGACGGCCACCCCGTCTACCCGCGGTCCCGGCCACCACCCGAGTCACCACACGACGCACAAGGCGGCGCTGGGATACACTACGGCGTAAGA 
LPVGQMGARAGGGLSGVLRVPPRPYVMPHS 

Fie . /i 

2^ 




iMcrrday, July 25. 1 999 2.42 PiVl 

htn-143)-H9-ht(l3l-332).rnod M > 2355) 5 :> - and Sacuenca 



CCGGCAGCCGGCAAGC77 AC 7CAAC 7CC^^i 7 7GC 77GCC A AA 7 7GGCGGAG7 7 G<^ 7'^r.-^— , r - , r _ _ 



OZCaCATCa 77 7CGGA7G 



GG^CGiLGuCCui i v-GAA i GAG \ i GiuGAG i 7 7AACGAACGG 777AACCGCC7CA AC r AG-r r.rrrr^r r * t. ~ ' 729 

_ r.y C - - » " -^ Ut - uG 1 --•^LC ib I AG i AAAGCC 7 AC 

i wn9 5 | n\nc3 J - hJCC 1 (151-332; ' 



G :< L i G L L K L L A X L A Z L V A a a 



A C [ {SO 

7GGCGG AC A 7C A "C A AGGGC A 7CC 7CGGAG.- AG 7G 7GGG.AG 7 7CA 7C AC A A AC r - r <"C 7C *rrr~rr-* i 

"i i i ■ i = 1 : ^ ---v, -^il i^AAAG AGCi TTGGGACAAGCrCAC 

ACCGCC7G7AG7AG7 7CCCG7AGGAGCC7C7rCACACCC7CAAG7AG7G7 7 7Grr.cn A G7 7--.— - .r----- ~ 1 ' ^ 1320 

- ^ , " U "-w—-C! , 1LICGAAACCC7G 7 7CGAG7G 

" 'hiCCl (T* - - 

V A 0 f I KG ! L G £ V W > F [ 7 N A |_ N n' I v ~ t 

L tN G L < L W 0 K L T 

GGGG7GGG 7GACCGGAC 7G 77C7C 7CGAGGG7GG 7CGAACC 7GGAG 7CC 77C 77 TQCQQQrr rrrrrrrr 

i 1 i : f : , , . , '^^LbLC^iCCCCGGCi tGA CCGGCGCGACC A GCGGC 

CCCCACCCACrGGCCrGACAAGAGAGCrcCCACCAGCrrGGACCTCAGGAAGAAACGCCCG-"Arrrrrr-^/--r.^^ ' ' '~ 19 1 1 

uG " J kCC* J AACiGGCCGCGC7GG7CGC-CG 

GWV 'GLFSaGWSNLESFFAGV? GLTGArsG 
^GrCGCAAGTGACrGGCrTGTiCGGrGCGGCCGGfCr^ 

AACAGCG.7CAC7GACCGAACAAGCCACGCCGGCCAGACAGGCG7AGCAGCCCGAACCGAG;GCGCC7A7CGGACCGC7CGAG7CGG7CGA 2002 



'hTCCl (161-392) 

I L 3 Q V ' ^ L P ^ A A G L S A 5 5 G L A H A 0 3 L A S S A 5 

| ^" < ^ t ~ C ~ < ~'^~ < r < ~ ~^*~^'— TTGGQGGCGGC5TCCGG "TTTTGGGGQCT FCSCCGACSCTC TGGCTCAOG TCCA TGCCT-CSCC TC AAC TCGQCAGGCG €^-7 

S ACGGGCGGGACCGGCCGrAACCCCC GCCCAGGCCAAAACCCCCGAACGGCrCGGACCGAGTCCAGGrACGGCGGAGrTGAGCGG-CCSCGA 2093 

'hTCCl ' <g< 

™" L P A L A G [ G G G 3 G F G G U P S L A G V H A A 5 T R Q A L 
g ^^CCCCC-AGCTGATGGCCCGGTCGGCGCCGCTGCCGAGCAGGrCGGCGGGCAGTCGCAGCTGGrcrcCGCGCAGGG-T CCCAAGGTA 

U 'GCCGGGGCTCGACTACCGGGCCAGCC GCGGCGACGGCrCGrcCAGCCGCCCG-CAGCGrCGACCAGAGGCGCGrCCCAAGGG-rcCAfAC ^ 
3 hTCCl 

| RPRAOGPVGAAAhQVGGQSCLVSAQGSQGri 

^CGGACCCGTAGGCATGGGCGGCA-GCACCCCTCTTCGGGGGCGTCGAAAGGGACGACGACGAAGAAGTACrCGGAAGGCGCGGCGGCGG 

CCGCCTGGGCArCCGTACCCGCCG rACGTGGGGAGAAGCCCCCGCAGCrrlcCCrGCTGC-GCT-CTrCAlGAGCCTTCCGCGCCGCCGCC ^ 
' 1 hTCCl / - |g< 



GGPVGMGGMHPSSGASKG 



T «i<YSEGA A 



GCACrGAAGACGCCGAGCGCGCGCCAGrCGAAGCrGACGCGGGCGGrGGGCAAAAGGrGC-GGTACGAAACGrCGrCTAAC GGCGAArTC 

CGTGACTTC-GCGGCTCGCGC GCGGTCAGCrjCGACTGCGCCCGCCACCCG--TTCCACGACCATGCTTTGCAGCAGATTGCCGCr-AAG 
hTCCl (161-392)— . , 

GTEOA £RAPVEAOAGGGQKVLVRNVV R R I 



Arc ; // 



Monday, July 25, 1SSS 2:*3 =« ' ' ' ^ w ' "~ / * ' ' • j > w_ ^ ^ v , _ 

ht(ia4-392)-HS-;it(1-200).mp<j (1 > 2^5) SiJe ' Sequence V A=ace 1 

Enzyrr.es : 3 of 5 1 5 enzymes (Filtered) - - l_ 

Serenes: Linear, Cerain Sit as ^^Bs t ankard Genetic Code ^^ft 

CA TA FGCA TCACCA TCACCA rCACGA ~G !"G*GC^Sca fCA fCAAGGGCA rCCrCGGAG*AG ~5 ^GGGAGT rC>^B^AAACG£QC "CAACGGCC rC^A^fi 

G TA "ACG TAG ~GG TAG TGG TAG rGC'ACACCGCCTG TAG TAG rrCCCGfi'iGAGCCTC T \C*C*rrrT* ,r-., ------ '' ' ' r 'CO 

, u > /U!IC -,~ : „ U ' — G'-Gi-^. i lUL^GAGTTGCCGGACTrrc 

r 1 "" 'Met /MS inG ■ ■ ]} 'hTCCl p 3 *' ~><*^ - 

» « H H H H H H 0 V A 0 I [ X C I L G £ V w Z F [ f N A L N G L K 

agc rr tgggacaagc tcacgggg tggg "gaccggac tg rrc rc tcg aggg ~gg tcga a cc ~~gag rr-^ ~7Trr-/-~r - 

1 1 : ; . 1 ; : : : ■ ■ ^ " : 1 1 '■jCvjGuC'.; i '.CiCGGC T TGACCGGCGC 

-CGAAACCCTGTTCGAGTGCCCCACCCAcrGGcc-CACAAGAGAGcrcccACCAGC^^^ 200 

————— — — hTCC 1 (1 34-3S2) ~ " J 

■I I W 0 X L T G W V r G L F S S G W S N I e 5 r - , „ 

- - - • * G V ? G L T G A 

GACCAGCGGC TTG TCGCA AG TGAC TGGCTTG TTCGG TGCGGCCCG TC TG TCCGCA TCG TCGGGC T TGGC TCACGCGGA TAGCC T GGCGAGC TCAGCCAGC 

C TGG TCGCCGA AC AGCG TTCAC TGAC CG A AC A A GCC AC GCCGGCC AG AC AGGCGTAGCAGC CCG A A CCG AG TGCGCCTA TCGGACCC TC r AG ~CT 3 °° 
' — — — — ■ — — hTCCl (1 34-332) " '^ G "- J 

^sglsqvtgl.-gaaglsassglahaoslassas 

TrGCCCGCCC-GGCCGGCATTGGGGGCGGGiCCGGrrTTGGGGGCrrGCCGAGCC-GGCrCAGGTCCA -GCCGCC-CAAC-CGGCAGGCGCTA^r.rrr.- 

AACGGGCGGGACCGGC CGrAACCCCCGCCCAGGCCAAAACCCCCGAACGGCTCGGACCGAG-CCAGG-ACGGCGGAGrrGAGCCG-CCGCGATGCCGGGG 
==! ■^^"^"^^^^^"■■^"^^^^■■■i^^™^^^^^^ hTCCl (184-392) ~ " 

J L ? A L A G I G G G 5 G f G G L P S L A Q V H A A 5 T R 0 A L s ? 

g GAGCrGA-GGCCCGGrCGGCGCCGC-GCCGAGCAGG-CGGCGGGCAGrcGCAGCrGGTCrcCGCGCAGGGrTCCCAAGGrATGGGC GGACCCGrAGGCAr 

Q3 C TCG AC T A CCGGGCC AGCCG CGG CG A CGGC TCG TC C AGCCGCCCGTC AGC GTCG A CC A GAGGCGCGTC CCA AGGG 'TC C A "ACCCGCC7GGGC A T-CGGfi" S °° 
131 —————— — — — — — ■ hTCCl (134-392) — — — — " 

R A 0 G P -/ G A A A £ Q V G G Q 3 Q t V S A a G S Q G'-rt "G G' P V ' G - M 
T GGGCGGCATGCACCCCTC-TCGGGGGCGrCGAAAGGGACGACGACGAAGAAGrACTCGGAAGGCGCGGCGGCGGGCACTGAAGACGCCGAG CGCGCGCCA 
M C C CGCCG TA CG T GGGG A GA A GC CC C C GC AG CTTTCCCrGCTGCTGCTTCTTC A TG AG CCTTCCGCGCCGCCGCCCGTG AC TTCTGCGGCTCGCGCGCGGT ^ 

«=* j - 

^ G G " H ? 3 S G A S K G T T T X K Y S S G A A A G T £ 0 A £ R A P 
2 G TCG A AGC TGACGCGGGCGGTGGGC AA AAGGTGC TGG TACGAAACG TCG TCG A A ~TC A TGGTGGATTTCGGGGCG TTACCA CCGGAGA TCAACTCCGCGA 

M CAGCTTCGACTGCGCCCGCCACCCGTTTTCCACGACCATGCTTTGCAGCAGCn^TACCACCTAAAGCCCCGCAATGGTGGCCTCTAGTTGAGGCGCT ^ 

1 hTCCl (1 34-392) I |£cofll ] -p^g 

VEAOAGGCQKVLVRNVVEFMVOrGALPPeiNSA 

GGATGTACGCCGGCCCGGGTTCGGCCTCGCTGGTGGCCGCGGCTCAGATGTGGGACAGCGTGGCGAGTGACCTG-TTTCGGCCGC GTCGGCGTTTCAGTC 

CCTACATGCGGCCGGGCCCAAGCCGGAGCGACCACCGGCGCCGAGTCTACACCCTGTCGCACCGCTCACTGGACAAAAGCCGGCGCAGCCGCAAAGTCAG 
— — — ■ — — 1^— ThH9— — — — — — „ 

RMYAGPGSASLVAAAQMWOSVASOLfSAASAFQS 

GG TGG TC TGGGG TC TGACGGTGGGG TCG TGG A TAGG " TCG TCGGCGGGTCTG A TGG TGG CGGCGGCCTCGCCGTATGTGGCGTGGATG AGCG TC AC CGCG 

— 1 ' ' 1 ' ' 1 1 1 1 1 i i ! 1 i 1 j i 900 

CCACCAGACCCCAGACTGCCACCCCAGCACCTATCCAAGCAGCCGCCCAGACTACCACCGCCGCCGGAGCGGCATACACCGCACCTACTCGCAGTGGCGC 
1 — ^^^— — ^m-^— TbH9 - 

VVW GLTVGSW[GSSAGl_f1VAAASPYVAWrtSVTA 



Icncay, July 25, 1SS9 2.45 PM ■ 
tf134-392)-H9-htf l-2G0>.fncd (1 > 2^3) Site an ^ acuence , " 



GGGCAGGCCGAGC 7GACCGCCGCCCAGG rcCG^^^Bt 7GCGGCGGCC 7ACGAGACGGCG 71 7GGGC ~GACGG|^^^CCGCCCG 7GA 7CGCCGAGAAC r 

" " iOCC 



H = i ■ ri i = 1 = 'r— i ^ : j— 



r^CGGGGC 



CCCG7CCGGCTCGAC7GGCGGCGGGrCCAGGCCCAACGACGCCGCCCCA7GC i"C7GCCGCA7ACCCGAC 7-GCCaCGGGGGCGGCCAC 7AGCGGC7C 77GG 

I l TcKQ - 

G Q A £ L T A A 0 V 3 V A A A .A Y £ 7 A Y G L 7 7 a ? y I A £ N 

G 7GC 7GA AC 7GA 7G A 7 7C7G A 7 AGCGACCAACC7C 7 7GGGGC A A AACACCCCGGCG A 7CGCGG 7CA ACGAGGCCGAA 7AGGGCGAGA 7G 7GGGCCCA AGA ■ 

' i 5 5 ' 1 5 1 5 1 1 ! ' : s = = r I ICO 

CACGAG7TGACTAC7AAGAG7i7CGC7GG 7 7GGAGAACCCCG77 7 7G7GGGGCCGG7AGCGCCAGr7GC7CGGGC7 7A7GCCGC7C7AGACCCGGG7 7G7 

■ 7cH9 " 

RA£LMfL(Ar.NLLGQNr. 3 AiAVN£A£y G£f1WAQ0 

CGCCGCCGCCA7G77rGGC7ACGCCGCGGCGACGGCGACGGCGACGGCGACG7 "GC TGCCG 77CGAGGAGGCGCCGGAGATGACCAGCGCGGG FGGGC TC 

x ! ! • ' 1 ' 5 ! ! ' : ! 5 i ^ 1 — 1 i OHO 

GCGGCGGCGCTACAAACCGArGCGGCGCCGCTGCCGCfGCCGCrGCCGCtGCAACGACGGCAAGCrCCrcCGCGGCCrcrACiGGrCGCGCCCACCCGAG 

■AAAflFGYAAArATATATLLPFEPAPErtTSAGGL - 

C7CGAGCAGGCCGCCGCGG7CGAGGAGGCC 7CCGACACCGCCGCGGCGAACCAG 77GA 7GAACAA 7G 7GCCCCAGGCGC 7GCA ACAGC 7GGC r CAGC r CA 

' ; ! * 1 1 ! ! ; ; i ! ' ■ ' i » 1300 

GAGC7CG7CCGGCGGCGCCAGC 7CC 7CCGGAGGC 7G 7GGCGGC GCCGC 77GG7C A AC 7AC77G 7 7AC A CGGGG7CCGCGACG 77G 7CGACCGGG 7CGGG 7 
' TaHS ——————— —^-^——i^^^™™.™ 

L £ Q A A AV££AS0TAAANQLi1NN 7 PQALQGLAQP 

D 

yilCGC AGGGC ACC ACGCC 7 7C 77CCAAGC7GGG 7GGCC7G7GG A AG ACGG 7C7CGCCGCA 7CGG7CGCCGA 7CAGCAACA7GG7G7CGA 7GGCCAACAACCA 

' i ' i 5 ' i ' ! ! = ! i ' i 1 ■ i r 1^00 

r^GCGTCCCGTGG 7GCGGAAGAAGG 77CGACCCACCGGACACC 77C TGCCAGAGCGGCG 7AGCCAGCGGC7AG7CG 77G rACCACAGCTACCGGTTGrfGGT 

m TbH9 1 — i — — - 

git a G T T P S S K L G G L W X T V 2 ? H R 5 ? [ S N n V S fl A • N.. N' H 

SJ •. • . " 

^: CA TG 7CG A 7GACC A AC 7CGGG7G7G7CG A 7GACCAACACC 7 7GAGC 7CGA 7G 77GA AGGGC 777GC 7CCGGCGGCGGCCGC r CAGGCCG7GCAAACCG'~G 

iy ' i = ! 1 : * i ' ! * i i : * i , ; , ' r , 500 

- G7ACAGC7AC7GG77GAGCCCACACAGC7AC7GG77G7GGAAC7CGAGC7ACAAC77CCCGAAACGAGGCCGCCGCCGGCGGG7CCGGCACG777GGCGC 
M i TbH9 | 

Q M S tt 7 N S G V S tt 7 N 7 L S 5 rt L X G F A '? A A A A Q A V Q 7 A 

y= 

Q GCGC AAA A CGGGG 7CCGGGCGA7G AGC 7CGC 7GGGCAGC 7CGC 7GGG 7 7C77CGGG7C7GGGCGGTGGGG7GGCCGCCAAC77GGG7CGGGCGGCC7CGG 

^ 1 : 1 i ■ ! 1 \ ' ! i ■ : ! : , : , - r , 600 

CSCG 77 77GCCCCAGGCCCGC7AC7CGAGCG AC CCG7CGAGCGACCC A AGA AGCCC AG A CCCGCCACCCCACCGGCGG77GAACCCAGCCCGCCGGAGCC 

AQNGVRAMSSLGSSLGSSGLGGGVAANLGRAAS 

TCGGTTCGTTGTCGGTGCCGCAGGCCTGGGCCGCGGCCAACCAGGCAGTCACCCCGGCGGCGCGGGCSCTGCCGCTGACCAGCCrGACCAGCGCCGCGGA 
• i 1 i ' ! ' 1 ' i • ! 1 i . i 1 i , l. t70 0 

AGCCAAGCAACAGCCACGGCGTCCGGACCCGGCGCCGGTTGGTCCGTCAGTGGGGCCGCCGCGCCCGCGACGGCGACTGGTCGGACrGGrCGCGGCGCCT 

V G SL 3 V P Q A W A A A N G A V 7? A A R A !_ P L 7 S L 7SAA £ 

AAGAGGGCCCGGGCAGA7GCTGGGCGGGC7GCCGG7GGGGCAGA7GGGCGCCAGGGCCGG7GG7GGGC7CAG7GG7G7GC7GCG7G77CCGCCGCGACCC 

" i f 1 1 i 1 H3 ' i 1 i s 1 1 f 1 f 1 f- 1300 

TTCTCCCGGGCCCGTCTACGACCCGCCCGACGGCCACCCCGTCTACCCGCGGTCCCGGCCACCACCCGAGTCACCACACGACGCACAAGGCGGCGCTGGG 

RGPGGMUGGLPVGGMGARAG-GGLSGVLRVPPRP 

TATG7GA7GCCGCArTC7CCGGCAGCCGGCGA7A7CA7GAGCAGAGCG77CA7CA7CGATCCAACGA7CAG7GCCA77GACGGC7TG7ACGACC7TCTGG 
1 i 1 1 1 j 1 1 1 ( 1 1 i 1 1 , 1 , u 1900 

ArACACrACGGCGrAAGAGGCCG7CGGCCGCTA7AG7ACrCGrcrCGCAAGTAG7AGCTAGGT7GCTAGrCACGGTAACrGCCGAACA7GCTGGAAGACC 

TbH9 1 1 RV | hTCCl (1-200) 

YVMPH5PAAGQ [fISRAF [ [CPfiSAIOGLYOLL 



lor.day," July 25. T399. 2:*5^M ■ >. : ^ . ■ ^~ri£V£ 



GGA 7 7GGA - TACCC AACCAAGGGGG I A 7CC 7 T^^p~C "TCAC FaGAG i AC t i *-i A A A A A u C C C i Gu-GuAu-- »^^^CAGCG 7 77CCGGG 7GA 7GGCt£ 

.. - ~ • • " 2CCO 

CC7AACC 77A 7GGG 7 7GG 77CCCCC A 7 AGG AAA 7G A GG AG 7GA FC FCA i GAAGC i T i 7 ; CGGGaCC 7CC 7CGACCG i"CG 7CGC A A~AGGCCC AC 7ACCGAC 

G [ Q i ? N Q G G ( L Y 3 5 L £ Y r £ X A L E £ L A A A f ? G G G W 

G77AGG77CGGCCGCGGACAAA7ACGCCGGCAAAAACCGCAACCACGfGAA77r77rCCAGGAAC7GGCAGACC7CGA7CG7CAGC rCA7CAGCC f G A 7 C 

1 ■— 1 1 ; ■ • 1 1 5 1 ' i '- r 2100 

CAA7CCAAGCCGGCGCC7G77 7A7GCGGCCG7 7777GGCG77GG7GCACr7AAAAAAGG7CG i 7GACCG7C7GGAGC7AGGAG i CGAG TAG7CGGAC7AG 

- ;-TCC 1 ( i -200) 

L G S A a 0 :< Y A G :< N 3 N H V N F Q € L A 0 L 0 R Q L t S L ( 

"ACGACCAGGC" A ACGCGG7CCAGACGACCCGCGAC A 7CC7GGAGGGCGCCA AG A A AGG 7C 7CGAG 7TCG 7GCGCCCGG 7GGC 7G7GGACC 7GACC T AC A 

. ■ i i i i ! 5 ! ; j 1 ! ! ! i ' i ' r 2200 

G7GC7GG7CCGG77GCGCCAGG7C7GC7GGGCGC7G7AGGACC7CCCGCGG7 7C777CCAGAGC7CAAGCACGCGGGCCACCGACACC7GGAC7GGA7G7 

■■^^ (1-200)— — — — ^— — — — , 

. HO QANAVQTTRO[L£GA:<KGL£FVRPVAVOLTY 
TCCCGG 7C G 7CGGGC ACGC CC T A 7CGGCCGCC 77CCAGGCGCCG 77 77GCGCGGGCGCG A 7QGCCG TAG 7GGGCGGCGCGC 7 7GCCT AC T7GGTCG 7GAA 



AGGGCCAGCAGCCCG7GCGGGA7AGCCGGCGGAAGG7CCGCGGCAAAACGCGCCCGCGC7ACCGGCA7CACCCGCCGCGCGAACGGA7GAACCAGCAC7 

u ~ r ^^" (1-20G) 



23CO 



^rpvVGHALSAAFQAP.rCAGArtAVVGGALAYLVVK 

LJ ' 

^AACGC7GA7CAACGCGAC7CAAC7CC7CAAA77GC77GCCAAA77GGCGGAG77GG7CGCGGCCGCCAT7GCGGACA7CA777CGGA7G7GGCGGACA7C 

ff\ f 1 , i ■ 1 • — ' ! i ' 5 * i > i ' r 2400 

CD 77GCGAC 7AG 7TGCGC 7GAG77GAGGAG 777AACGAACGG 777AACCGCC 7CAACCAGCGCCGGCGGrAACGCC7G7AG7AAAGCC 7ACACCGCC7G7AG 

fgj - hTCCl (1 -2Q0) ^ 

01 - j_ [ lN a 7 0 L L K L L A K L A £ L V A A A I A Q [ [ S 0 V ■ ' A . - G . t 
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Figure • : Nucleoside secri2P.ce of MTb S 9 

cacgacCgcccgaccgaacccgaaccagtcagcacaaaccgaactaggaagacgaaaagctatcgc 
cgagucgacaatccccgcigaugacatccagagcgcaatcgaagagtacgtaagctcttccaccgc 
cgacaccagcagagaggaagccggt:accgccgt:cgat:gccggggacggcaccgcacacgtcgaggg 
ctcgccaccggtgatgacccaagagccgciicgaatucccgggcggaaccctcggcgccgcccccaa 
ccccgacgagcacagcgccggcgcggcgacccucggcgacctcgagaacatcgaagaaggccagca 
ggtcaagcgcaccggcgaagcctcaccggc^ccggccggcgacgggtcuctiggggcgggtggttaa 
cccgctcggccagccgat:cgacgggcgcggagacgt:cgaccccgatact:cggcgcgcgct:ggagcc 
ccaggcgccctcggtggugcaccggcaaggcgtgaaggagccgtcgcagaccgggatcaaggcgat: 
tgacgcgatgaccccgatcggccgcggccagcgccagctgaucatcggcgaccgcaagaccggcaa 
aaccgccgcctgcgtcgacaccatccccaaccagcggcagaactgggagcccggugatCGcaagaa 
gcaggcgcgccgugtacacguggccaccgggcagaagggaactaccatcgccgcggtacgccgcac 
accggaagagggcggtgcgatggaccacaccaccatcgtcgcggccgcggcgtcggagcccgccgg 
tttcaaatggcttgcgccgtacaccggttcggcgaccgcccagcactggacgcacgagggcaagca 
tgtgccgatcatcttcgacgacccgaccaagcaggccgaggcataccgggcgacctcgctgctgct 
gcgccgtccgcccggccgcgaggcccaccccggcgatgtgttctatccgcattcgcggcttttgga 
gcgctgcgccaaactgtccgacgatctcggtggcggctcgctaacgggcccgccgatcaccgagac 
caaggccaacgacatctcggcctacatcccgaccaacgtcatctcgatcaccgacgggcaatgttt 
cccggaaaccgacctgttcaaccacggcgtccggccggccatcaacgtcggcgtgtcggtgtcccg 
agtcggcggcgcggcgcagaficaaggctatgaaagaggucgccggaagcGuccgcttggacctttc 
gcaataccgcgagctagaagctcccgccgctttcgcttctgatttggacgccgcatcgaaggcgca 
gttggagcgcggcgcccggccggtcgagctgctcaagcagccgcaatcccagcccatgcccgttga 
ggagcaagtggtttcgatcttcctgggcaccggcggtcacctggactcggcgcccgtcgaggacgt 
ccggcggtccgaaaccgaattactggaccacatgcgggcctccgaagaagagattLtgactgagat 
ccgggacagccaaaagctcaccgaggaggccgccgacaagct:caccgaggtcatcaagaacttcaa 
gaagggcctcgcggccaccggtggcggctctgtggtgcccgacgaacacgtcgaggccctcgacga 
ggataagcccgccaaggaagccgtgaaggLcaaaaagccggcgccgaagaagaagaaatagctaac 
catggctgccacacttcgcgaactacgcgggcggatccgcucggcagggtcgatcaaaaagatcac 
caaggcccaggagctgattgcgacatcgcgcatcgccagggcgcaggctcggctcgagtccgctcg 
gccctacgcttttgagatcacccggatgcttaccaccctggccgctigaagccgcactggaccaticc 

gttgct 




Figura . : Amino acid sequence of M7b5 9 

MAELT I PADD IQS AI SEYVS S FTADTS RSEVGTWDAGDG I AKVEGL PSVMTQELLEFPGGILGVA 
LNLDEKS VGAVI LGD FENI EEGQQVKRTGEVLS VP VGDGFLGRWN? LGQ? I DGRGDVDSDTRRAL 
ELQAP S WHRQG VKE PLQTG I KA I D AMT ? I GRGQ RQLI I GDRKTG KTAVCVDT I LNQRQNWE SC-D ? 
KXQ VRCVYVA I GQ KGTT I AAVRRTLE EGG AMD YTT I VAAAA3 E S AG F KWLA? YTGS A I AQHWMYEG 
KHVLI I FDD LT KQ AEAY'RA I S L 1 L RR ? ? GREA Y PGDVFYLH3RLLE RC AXL 5 D D LGGGS LTGL P 1 1 
ETKAND 1 5 AY I PTNVI 5 ITDGQC 7LETDLFNQGVR? AI NVGVS VS RVGGAAQ I KAMKEVAGSLRLD 
LSQYRELEAFAAFASDLDAASKAQLERGARLVELLKQPQSQPMPVEEQVVSIFLGTGGKLDSVPVE 
DVRRFETELLDHMRASSESILTEIRDSQKLTSEAADKLTEVI KNFKKGFAATGGGSWPDEKVEAL 

DEDKIAKEAVKVKKPAPKXKX 
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ccagcccccgccccgcccacgcccacgracgtiggactgatggccaaagcgccagagaccgaacgcc 
cgggccccggcacccaaccggcggacgcccagaccgcgacgcccgcgacggcccgacccccgagca 
cccaggcggcgt cccgccccgaccccggcgatgaggacaaciiucccccauccgacgctcggcccgg 
acaccgagccgcaagaccggacggccaccaccagccgggcgcgcccgccggccagacggccgggcg 
gcggcctggtggaaaccccgcgggcgcccgacaLcgatccgccugaggccctgacgaccaacccgg 
tggtgccggagcccaagcggccccgccggaaccgcggacgccccgccggcccgtccgactcggaga 
ccaagggagcutcagagggccggCgcccctautgcggcagcccgcatLicgcccctgccgcagctaa 
atcccggggacatcgccgccggccagcacgaggccaaaggccgcatcgcgcacggcggactgggct: 
ggacctacctcgctcccgaccgcaacgccaacggccgtccggtggugGucaagggcGtggtgcatt: 
ccggtgacgccgaagGgcaggcaacggcgacggccgaaGgGcagttGGcggccgaggtggcgcacc 
cgtcgaccgtgGagatctuGaactctgtcgagcacaccgaGaggcacggggatccggccggctaca 
tcgcgatggaacacgtcggcgggcaatcgcccaaacgcagcaagggccagaaactgcccgtcgcgg 
aggccaCcgcccacctgctggagaccctgccggcgciigagccacctgcaticccauGggcttggtct: 
acaacgaGCtgaagcGggaaaacauGatgctgaGcgaggaacagctGaagccgatGgacctgggcg 
cggCatcgcggatcaactcgttcggccacctctacgggaccccaggctcccaggcgcccgagaLcg 
tgcggacGggtcGgacggtggccaGGgacatiGtacaGcgtgggacgGaGgccGgcggGgcccacgG 
tggacGtgcGcacGcgcaauggGcgccacigtggatgggctaGGGgaagacgacccggtgctgaaaa 
cctacgactcttacggccggttgctgcgcagggccatcgaccccgatccgcggcaacggctcacca 
ccgGcgaagagatgtGcgGgcaaccgacgggcgtgttgcgggaggtggucgcGGaggacaccgggg 
tgGGgcggccagggctatcaaGgatGtiCcagtiGccagtGggtcgacatttggagtggacctgctgg 
tggcgcacaccgaGgtigtatctggaGgggGaggtgGacgcggagaagctgacGgccaacgagatGg 
tgacGgcgGtgtcggtgccgctggtGgacGcgaccgacgtiGgcagcttcggccctgcaggccacgg 
tgctctrcccagccggtgcagaccctagactcgctgcgcgcggcccgccacggtigcgctggacgccg 
acggcgtcgacttctccgagtcagtggagctgccgccaatiggaagtccgcgcgccgctggatctcg 
gcgatgtggccaaggccacccgaaaactcgacgatctggccgaacgcgttggctggcgatggcgat 
tggtctggtaccgggccgtcgccgagctgctcaccggcgactatgactcggccaccaaacatttca 
ccgaggtgctggatacccttcccggcgagctggcgcccaagctcgccctggccgccaccgccgaac 
tagccggcaacaccgacgaacacaagttctatcagacggtigtggagcaccaacgacggcgtgatct: 
cggcggctttcggactggccagagcccggtcggccgaaggtgatcgggtcggcgccgtgcgcacgc 
tcgacgaggtaccgcccacttctcggcatttcaccacggcacggctgaccagcgcggtgactctgt 
tgtccggccggtcaacgagtgaagtcaccgaggaacagatccgcgacgccgcccgaagagtggagg 
cgctgcccccgaccgaaccacgcgtgctgcagatccgcgccctggtgctgggtggcgcgctggact: 
ggctgaaggacaacaaggccagcaccaaccacatcctcggtttcccgttcaccagtcacgggctgc 
ggctgggtgtcgaggcgtcactgcgcagcctggcccgggcagcccccactcaacggcatcgctaca 
cgctggtggacatggccaacaaggtccggcccaccagcacgttccaagccgcccgagtgtgaatcg 



Figure - : Ami no acid sequap.ce of MTb3 2 



MAKAS ETER5GPGTQ ? ADAQTATS ATVRPL3TQ Av FRPDFGD EDNF? H PTLGPDTEPQDRMATT5 R 
VRP PVRRLGGGLVE I PRAPDI DPLEALMTN'PWPES KRFCWNCGRPVGRSDS STKGAS SGWCPYCG 
SPYSFLPQ^PGDIVAC^YEVKGCIAHC^-LGWIYLAL^ 

RQFLJ^VVKPSIVQIFNFVSHTDRHGDPVGYIVMEYVGGQSLKRSKGQKLPVAEAIAYLLEILPAL 
SYLHS IGLVYNDLKPENIMLTEEQLKLIDLGAV5RINS FGYLYGTPGFQA.PS I VRTGPTVATD I YT 
VGRTLAALTLD L PTRNGR YVDGL PEDDPVLKTYDS YGRLLRRA IDPDPRQ R FTTAE SMS AQ LTGVL 
REWAQDTGVPRPGLSTIFSPSRSTFGVDLLVAI-iTDVYLDGQVKAEKLTANEIVTALSVPLVDPTD 
VAA5 VLQATVLSQ ? VQTLDS LRAARHGALDADGVD FSESVELP LME VRALLDLGD VAKATRKLDDL 
AERVGWRWRLVWYRAVAELLTGDYDSATKHFTEVLDTFPGELJ^^ 

VWSTNDGVISAAFGIJ^ARSAEGDRVGAVRTLDEVPPTSRHFTTARIjTSAVTLLSGRSTSEVTEEQ 

i rdaarrvsalp pte prvlq i ralvlggald wlkdnkastnk i lgf ? fts hglrlgveas lrs lar 
vaptqrhrytlvdmankvrptstf . 
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Figure . : amino Acid Sequence of secreced D??D 

□ PPDPHQ?DMTKGYC?GG?/rtGFGDLAVCDG2KY?DGS 
G ? ? ? PGGCGGAI ?3 SQ ?NA? 
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